- Publisher: Chapman and Hall/CRC; 1 edition (August 3, 2018)
- Language: English
- ISBN-10: 1138316431
- ISBN-13: 978-1138316430
Compositional Data Analysis in Practice Is a user-oriented practical guide to the analysis of data with the property of a constant sum, for example percentages adding up to 100%. Compositional data can give misleading results if regular statistical methods are applied, and are best analysed by first transforming them To logarithms of ratios. This book explains how this transformation affects the analysis, results and interpretation of this very special type of data. All aspects of compositional data analysis are considered: visualization, modelling, dimension-reduction, clustering and variable selection, with many Examples in the fields of food science, archaeology, sociology and biochemistry, and a final chapter containing a complete case study using fatty acid compositions in ecology. The applicability of these methods extends to other fields such as linguistics, geochemistry,Marketing, economics and finance.
R Software
The following repository contains data files and R scripts from the book https://github.com/michaelgreenacre/CODAinPractice . The R package easyCODA, which accompanies this book, is available on CRAN -- note that you should have version 0.25 or higher. The latest version of the package will always be available on R-Forge and can be installed from R with this instruction: install.packages("easyCODA", repos="http://R-Forge.R-project.org") .
Review
"...an interesting book, certainly controversial in some respects for scholars in the field. It has a strong data analytic focus and requires some background in multivariate analysis and biplot theory for a good understanding. It overemphasizes links to correspondence analysis at times, but is Very well written and didactically nicely sliced into modules numbering exactly eight pages each. Most examples in the book are reproducible in the R environment. Finally, it will help the analyst to reflect on the use of weights, to the benefit of the analysis of compositional Data."
―Jan Graffelman in the Biometrical Journal , March 2019
The book takes a essential reference as a practical way to evaluate and interpret compositional data across a broad spectrum of disciplines in the life and natural sciences for both academia and industry. The book takes a prescribed approach starting with the definition of compositional data, the Use Of logratios for dimension reduction, clustering and variable selection issues along with several practical examples and a case study. The theory of compositional data analysis and computational aspects are
included as Appendices.
This book can be used at the undergraduate level as part of a course in data analysis. At the graduate level, for research studies, this book is essential in understanding how to collect and interpret compositional data. Using the methods described in this book will help To avoid costly mistakes made from misinterpreting compositional data."
―Professor Eric Grunsky, Department of Earth and Environmental Sciences, University of Waterloo Waterloo, Ontario, Canada
"Clearly the best introduction to compositional data analysis"
―Professor John Bacon-Shone
About the Author
Michael Greenacre is Professor of Statistics at the Universitat Pompeu Fabra, Barcelona, Spain, where he teaches a course, amongst others, on Data Visualization. He has authored and co-edited nine books and 80 journal articles and book chapters, mostly on correspondence analysis , the latest being Correspondence Analysis in Practice (Third Edition) in 2016. He has given short courses in fifteen countries to environmental scientists, sociologists, data scientists and marketing professionals, and has specialized in statistics in ecology and social science