Introductory statistics textbook with a focus on data science topics such as prediction, correlation, and data exploration Introductory Statistics Using Python is a comprehensive guide to statistical analysis using Python, presenting important topics useful for data science such as prediction, co
Statistics for Data Science and Analytics
β Scribed by Peter C. Bruce, Peter Gedeck, Janet Dobbins
- Publisher
- Wiley
- Year
- 2024
- Tongue
- English
- Leaves
- 384
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
Introductory statistics textbook with a focus on data science topics such as prediction, correlation, and data exploration
Statistics for Data Science and Analytics is a comprehensive guide to statistical analysis using Python, presenting important topics useful for data science such as prediction, correlation, and data exploration. The authors provide an introduction to statistical science and big data, as well as an overview of Python data structures and operations.
A range of statistical techniques are presented with their implementation in Python, including hypothesis testing, probability, exploratory data analysis, categorical variables, surveys and sampling, A/B testing, and correlation. The text introduces binary classification, a foundational element of machine learning, validation of statistical models by applying them to holdout data, and probability and inference via the easy-to-understand method of resampling and the bootstrap instead of using a myriad of βkitchen sinkβ formulas. Regression is taught both as a tool for explanation and for prediction.
This book is informed by the authorsβ experience designing and teaching both introductory statistics and machine learning at Statistics.com. Each chapter includes practical examples, explanations of the underlying concepts, and Python code snippets to help readers apply the techniques themselves.
Statistics for Data Science and Analytics includes information on sample topics such as:
- Int, float, and string data types, numerical operations, manipulating strings, converting data types, and advanced data structures like lists, dictionaries, and sets
- Experiment design via randomizing, blinding, and before-after pairing, as well as proportions and percents when handling binary data
- Specialized Python packages like numpy, scipy, pandas, scikit-learn and statsmodelsβthe workhorses of data scienceβand how to get the most value from them
- Statistical versus practical significance, random number generators, functions for code reuse, and binomial and normal probability distributions
Written by and for data science instructors, Statistics for Data Science and Analytics is an excellent learning resource for data science instructors prescribing a required intro stats course for their programs, as well as other students and professionals seeking to transition to the data science field.
π SIMILAR VOLUMES
Introductory statistics textbook with a focus on data science topics such as prediction, correlation, and data exploration Introductory Statistics Using Python is a comprehensive guide to statistical analysis using Python, presenting important topics useful for data science such as prediction, co
Dana S. Dunn, author of The Practical Researcher: A Student Guide to Conducting Psychological Research, brings his twelve years of statistics teaching experience to life in the new Statistics and Data Analysis for the Behavioral Sciences. Dr. Dunn combines the quantitative aspects of statistics wit
<p><span>This book brings together the best contributions of the Applied Statistics and Policy Analysis Conference 2019. Written by leading international experts in the field of statistics, data science and policy evaluation. This book explores the theme of effective policy methods through the use o
Dana S. Dunn, author of The Practical Researcher: A Student Guide to Conducting Psychological Research, brings his twelve years of statistics teaching experience to life in the new Statistics and Data Analysis for the Behavioral Sciences. Dr. Dunn combines the quantitative aspects of statistics wit