This book is aimed at professional researchers, practitioners, graduate students and teachers in ecology, environmental science and engineering, and in related fields such as oceanography, molecular ecology, agriculture and soil science, who already have a background in general and multivariate statistics and wish to apply this knowledge to their data using the R language, as well as people willing to accompany their disciplinary learning with practical applications. People from other fields (e.g. geology, geography, paleoecology, phylogenetics, anthropology, the social and education sciences, etc.) may also benefit from the materials presented in this book.
The three authors teach numerical ecology, both theoretical and practical, to a wide array of audiences, in regular courses in their Universities and in short courses given around the world. Daniel Borcard is lecturer of Biostatistics and Ecology and researcher in Numerical Ecology at Université de Montréal, Québec, Canada. François Gillet is professor of Community Ecology and Ecological Modelling at Université de Franche-Comté, Besançon, France. Pierre Legendre is professor of Quantitative Biology and Ecology at Université de Montréal, Fellow of the Royal Society of Canada, and ISI Highly Cited Researcher in Ecology/Environment.
"As with previous editions, the authors have produced a leading textbook on regression."
—Journal of the American Statistical Association
A comprehensive and up-to-date introduction to the fundamentals of regression analysis
Introduction to Linear Regression Analysis, Fifth Edition continues to present both the conventional and less common uses of linear regression in today’s cutting-edge scientific research. The authors blend both theory and application to equip readers with an understanding of the basic principles needed to apply regression model-building techniques in various fields of study, including engineering, management, and the health sciences.
Following a general introduction to regression modeling, including typical applications, a host of technical tools are outlined such as basic inference procedures, introductory aspects of model adequacy checking, and polynomial regression models and their variations. The book then discusses how transformations and weighted least squares can be used to resolve problems of model inadequacy and also how to deal with influential observations. The Fifth Edition features numerous newly added topics, including:A chapter on regression analysis of time series data that presents the Durbin-Watson test and other techniques for detecting autocorrelation as well as parameter estimation in time series regression models Regression models with random effects in addition to a discussion on subsampling and the importance of the mixed model Tests on individual regression coefficients and subsets of coefficients Examples of current uses of simple linear regression models and the use of multiple regression models for understanding patient satisfaction data.
In addition to Minitab, SAS, and S-PLUS, the authors have incorporated JMP and the freely available R software to illustrate the discussed techniques and procedures in this new edition. Numerous exercises have been added throughout, allowing readers to test their understanding of the material.
Introduction to Linear Regression Analysis, Fifth Edition is an excellent book for statistics and engineering courses on regression at the upper-undergraduate and graduate levels. The book also serves as a valuable, robust resource for professionals in the fields of engineering, life and biological sciences, and the social sciences.
A host of complex problems face today's earth science community, such as evaluating the supply of remaining non-renewable energy resources, assessing the impact of people on the environment, understanding climate change, and managing the use of water. Proper collection and analysis of data using statistical techniques contributes significantly toward the solution of these problems. Statistics for Earth and Environmental Scientists presents important statistical concepts through data analytic tools and shows readers how to apply them to real-world problems.
The authors present several different statistical approaches to the environmental sciences, including Bayesian and nonparametric methodologies. The book begins with an introduction to types of data, evaluation of data, modeling and estimation, random variation, and sampling—all of which are explored through case studies that use real data from earth science applications. Subsequent chapters focus on principles of modeling and the key methods and techniques for analyzing scientific data, including:
Interval estimation and Methods for analyzinghypothesis testing of means time series data
Most statistical models are introduced by concept and application, given as equations, and then accompanied by heuristic justification rather than a formal proof. Data analysis, model building, and statistical inference are stressed throughout, and readers are encouraged to collect their own data to incorporate into the exercises at the end of each chapter. Most data sets, graphs, and analyses are computed using R, but can be worked with using any statistical computing software. A related website features additional data sets, answers to selected exercises, and R code for the book's examples.
Statistics for Earth and Environmental Scientists is an excellent book for courses on quantitative methods in geology, geography, natural resources, and environmental sciences at the upper-undergraduate and graduate levels. It is also a valuable reference for earth scientists, geologists, hydrologists, and environmental statisticians who collect and analyze data in their everyday work.
Applications of data mining and ‘big data’ increasingly take center stage in our modern, knowledge-driven society, supported by advances in computing power, automated data acquisition, social media development and interactive, linkable internet software. This book presents a coherent, technical introduction to modern statistical learning and analytics, starting from the core foundations of statistics and probability. It includes an overview of probability and statistical distributions, basics of data manipulation and visualization, and the central components of standard statistical inferences. The majority of the text extends beyond these introductory topics, however, to supervised learning in linear regression, generalized linear models, and classification analytics. Finally, unsupervised learning via dimension reduction, cluster analysis, and market basket analysis are introduced.
Extensive examples using actual data (with sample R programming code) are provided, illustrating diverse informatic sources in genomics, biomedicine, ecological remote sensing, astronomy, socioeconomics, marketing, advertising and finance, among many others.
Statistical Data Analytics:Focuses on methods critically used in data mining and statistical informatics. Coherently describes the methods at an introductory level, with extensions to selected intermediate and advanced techniques. Provides informative, technical details for the highlighted methods. Employs the open-source R language as the computational vehicle – along with its burgeoning collection of online packages – to illustrate many of the analyses contained in the book. Concludes each chapter with a range of interesting and challenging homework exercises using actual data from a variety of informatic application areas.
This book will appeal as a classroom or training text to intermediate and advanced undergraduates, and to beginning graduate students, with sufficient background in calculus and matrix algebra. It will also serve as a source-book on the foundations of statistical informatics and data analytics to practitioners who regularly apply statistical learning to their modern data.
This Second Edition addresses matrix algebra that is useful in the statistical analysis of data as well as within statistics as a whole. The material is presented in an explanatory style rather than a formal theorem-proof format and is self-contained. Featuring numerous applied illustrations, numerical examples, and exercises, the book has been updated to include the use of SAS, MATLAB, and R for the execution of matrix computations. In addition, André I. Khuri, who has extensive research and teaching experience in the field, joins this new edition as co-author. The Second Edition also:Contains new coverage on vector spaces and linear transformations and discusses computational aspects of matrices Covers the analysis of balanced linear models using direct products of matrices Analyzes multiresponse linear models where several responses can be of interest Includes extensive use of SAS, MATLAB, and R throughout Contains over 400 examples and exercises to reinforce understanding along with select solutions Includes plentiful new illustrations depicting the importance of geometry as well as historical interludes
Matrix Algebra Useful for Statistics, Second Edition is an ideal textbook for advanced undergraduate and first-year graduate level courses in statistics and other related disciplines. The book is also appropriate as a reference for independent readers who use statistics and wish to improve their knowledge of matrix algebra.
THE LATE SHAYLE R. SEARLE, PHD, was professor emeritus of biometry at Cornell University. He was the author of Linear Models for Unbalanced Data and Linear Models and co-author of Generalized, Linear, and Mixed Models, Second Edition, Matrix Algebra for Applied Economics, and Variance Components, all published by Wiley. Dr. Searle received the Alexander von Humboldt Senior Scientist Award, and he was an honorary fellow of the Royal Society of New Zealand.
ANDRÉ I. KHURI, PHD, is Professor Emeritus of Statistics at the University of Florida. He is the author of Advanced Calculus with Applications in Statistics, Second Edition and co-author of Statistical Tests for Mixed Linear Models, all published by Wiley. Dr. Khuri is a member of numerous academic associations, among them the American Statistical Association and the Institute of Mathematical Statistics.
The book lays out the basic operations involved in the analysis, the underlying hypotheses, aims and points of views. It conveys the message that each step in the calculations has a specific, straightforward meaning and that patterns and processes known by ecologists often find their counterpart in mathematical operations and functions. The first chapters introduce the elementary concepts and operations and relate them to real-world phenomena and problems. Later chapters concentrate on combinations of methods to reveal surprising features in data sets. Showing how to find patterns in time series, how to generate simple dynamic models, how to reveal spatial patterns and related occurrence probability maps.