<p class="description">Learn the techniques and math you need to start making sense of your dataAbout This BookEnhance your knowledge of coding with data science theory for practical insight into data science and analysisMore than just a math class, learn how to perform real-world data science tasks
Principles of Data Science
โ Scribed by Ozdemir, Sinan
- Publisher
- Packt Publishing
- Year
- 2016
- Tongue
- English
- Leaves
- 389
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
Cover ; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: How to Sound Like a Data Scientist; What is data science?; Basic terminology; Why data science?; Example -- Sigma Technologies; The data science Venn diagram ; The math; Example -- spawner-recruit models; Computer programming; Why Python?; Python practices; Example of basic Python; Domain knowledge; Some more terminology; Data science case studies; Case study -- automating government paper pushing; Fire all humans, right?; Case study -- marketing dollars.;Learn the techniques and math you need to start making sense of your dataAbout This Book Enhance your knowledge of coding with data science theory for practical insight into data science and analysis More than just a math class, learn how to perform real-world data science tasks with R and Python Create actionable insights and transform raw data into tangible valueWho This Book Is ForYou should be fairly well acquainted with basic algebra and should feel comfortable reading snippets of R/Python as well as pseudo code. You should have the urge to learn and apply the techniques put forth in this book on either your own data sets or those provided to you. If you have the basic math skills but want to apply them in data science or you have good programming skills but lack math, then this book is for you. What You Will Learn Get to know the five most important steps of data science Use your data intelligently and learn how to handle it with care Bridge the gap between mathematics and programming Learn about probability, calculus, and how to use statistical models to control and clean your data and drive actionable results Build and evaluate baseline machine learning models Explore the most effective metrics to determine the success of your machine learning models Create data visualizations that communicate actionable insights Read and apply machine learning concepts to your problems and make actual predictionsIn DetailNeed to turn your skills at programming into effective data science skills? Principles of Data Science is created to help you join the dots between mathematics, programming, and business analysis. With this book, you'll feel confident about askingand answeringcomplex and sophisticated questions of your data to move from abstract and raw statistics to actionable ideas. With a unique approach that bridges the gap between mathematics and computer science, this books takes you through the entire data science pipeline. Beginning with cleaning and preparing data, and effective data mining strategies and techniques, you'll move on to build a comprehensive picture of how every piece of the data science puzzle fits together. Learn the fundamentals of computational mathematics and statistics, as well as some pseudocode being used today by data scientists and analysts. You'll get to grips with machine learning, discover the statistical models that help you take control and navigate even the densest datasets, and find out how to create powerful visualizations that communicate what your data means. Style and approachThis is an easy-to-understand and accessible tutorial. It is a step-by-step guide with use cases, examples, and illustrations to get you well-versed with the concepts of data science. Along with explaining the fundamentals, the book will also introduce you to slightly advanced concepts later on and will help you implement these techniques in the real world.
โฆ Table of Contents
Cover
Copyright
Credits
About the Author
About the Reviewers
www.PacktPub.com
Table of Contents
Preface
Chapter 1: How to Sound Like a Data Scientist
What is data science?
Basic terminology
Why data science?
Example --
Sigma Technologies
The data science Venn diagram
The math
Example --
spawner-recruit models
Computer programming
Why Python?
Python practices
Example of basic Python
Domain knowledge
Some more terminology
Data science case studies
Case study --
automating government paper pushing
Fire all humans, right?
Case study --
marketing dollars. Case study --
what's in a job description?Summary
Chapter 2: Types of Data
Flavors of data
Why look at these distinctions?
Structured versus unstructured data
Example of data preprocessing
Word/phrase counts
Presence of certain special characters
Relative length of text
Picking out topics
Quantitative versus qualitative data
Example --
coffee shop data
Example --
world alcohol consumption data
Digging deeper
The road thus far ...
The four levels of data
The nominal level
Mathematical operations allowed
Measures of center
What data is like at the nominal level
The ordinal level. ExamplesMathematical operations allowed
Measures of center
Quick recap and check
The interval level
Example
Mathematical operations allowed
Measures of center
Measures of variation
The ratio level
Examples
Measures of center
Problems with the ratio level
Data is in the eye of the beholder
Summary
Chapter 3: The Five Steps of Data Science
Introduction to Data Science
Overview of the five steps
Ask an interesting question
Obtain the data
Explore the data
Model the data
Communicate and visualize the results
Explore the data
Basic questions for data exploration. Dataset 1 --
YelpDataframes
Series
Exploration tips for qualitative data
Dataset 2 --
titanic
Summary
Chapter 4: Basic Mathematics
Mathematics as a discipline
Basic symbols and terminology
Vectors and matrices
Quick exercises
Answers
Arithmetic symbols
Summation
Proportional
Dot product
Graphs
Logarithms/exponents
Set theory
Linear algebra
Matrix multiplication
How to multiply matrices
Summary
Chapter 5: Impossible or Improbable --
A Gentle Introduction to Probability
Basic definitions
Probability
Bayesian versus Frequentist
Frequentist approach. The law of large numbersCompound events
Conditional probability
The rules of probability
The addition rule
Mutual exclusivity
The multiplication rule
Independence
Complementary events
A bit deeper
Summary
Chapter 6: Advanced Probability
Collectively exhaustive events
Bayesian ideas revisited
Bayes theorem
More applications of Bayes theorem
Example --
Titanic
Example --
medical studies
Random variables
Discrete random variables
Types of discrete random variables
Summary
Chapter 7: Basic Statistics
What are statistics?
How do we obtain and sample data?
Obtaining data.
โฆ Subjects
Data mining;Data structures;Database management;Electronic books
๐ SIMILAR VOLUMES
<p>This book provides readers with a thorough understanding of various research areas within the field of data science. The book introduces readers to various techniques for data acquisition, extraction, and cleaning, data summarizing and modeling, data analysis and communication techniques, data sc
<h4>Key Features</h4><ul><li>Enhance your knowledge of coding with data science theory for practical insight into data science and analysis</li><li>More than just a math class, learn how to perform real-world data science tasks with R and Python</li><li>Create actionable insights and transform raw d
<h4>Key Features</h4><ul><li>Enhance your knowledge of coding with data science theory for practical insight into data science and analysis</li><li>More than just a math class, learn how to perform real-world data science tasks with R and Python</li><li>Create actionable insights and transform raw d
This book introduces the topics of Big Data, data analytics and data science and features the use of open source data. Among the statistical topics described in this book are: data visualization, descriptive measures, probability, probability distributions, the concept of mathematical expectation, c
<p><b>Introduces readers to the principles of managerial statistics and data science, with an emphasis on statistical literacy of business students ย </b>ย </p> <p>Through a statistical perspective, this book introduces readers to the topic of data science, including Big Data, data analytics, and data