Data analysis is now part of practically every research project in the life sciences. In this book we use data and computer code to teach the necessary statistical concepts and programming skills to become a data analyst. Instead of showing theory first and then applying it to toy examples, we start
Data Analysis for the Life Sciences
β Scribed by Rafael A Irizarry and Michael I Love
- Publisher
- leanpub.com
- Year
- 2021
- Tongue
- English
- Leaves
- 2021
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
The unprecedented advance in digital technology during the second half of the 20th century has produced a measurement revolution that is transforming science. In the life sciences, data analysis is now part of practically every research project. Genomics, in particular, is being driven by new measurement technologies that permit us to observe certain molecular entities for the first time. These observations are leading to discoveries analogous to identifying microorganisms and other breakthroughs permitted by the invention of the microscope. Choice examples of these technologies are microarrays and next generation sequencing. This book will cover several of the statistical concepts and data analytic skills needed to succeed in data-driven life science research. We go from relatively basic concepts related to computing p-values to advanced topics related to analyzing high-throughput data.
While statistics textbooks focus on mathematics, this book focuses on using a computer to perform data analysis. Instead of explaining the mathematics and theory, and then showing examples, we start by stating a practical data-related challenge. This book also includes the computer code that provides a solution to the problem and helps illustrate the concepts behind the solution. By running the code yourself, and seeing data generation and analysis happen live, you will get a better intuition for the concepts, the mathematics, and the theory. The book was created using the R markdown language and we make all this code available to the reade
β¦ Table of Contents
Table of Contents
Acknowledgements
Introduction
Getting Started
Inference
Exploratory Data Analysis
Matrix Algebra
Linear Models
Inference For High Dimensional Data
Statistical Models
Distance and Dimension Reduction
Basic Machine Learning
Batch Effects
π SIMILAR VOLUMES
This book covers several of the statistical concepts and data analytic skills needed to succeed in data-driven life science research. The authors proceed from relatively basic concepts related to computed p-values to advanced topics related to analyzing highthroughput data. They include the R code t
Any practical introduction to statistics in the life sciences requires a focus on applications and computational statistics combined with a reasonable level of mathematical rigor. It must offer the right combination of data examples, statistical theory, and computing required for analysis today. And