𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Algorithms for Data Science

✍ Scribed by Brian Steele, John Chandler, Swarna Reddy (auth.)


Publisher
Springer International Publishing
Year
2016
Tongue
English
Leaves
438
Edition
1
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


This textbook on practical data analytics unites fundamental principles, algorithms, and data. Algorithms are the keystone of data analytics and the focal point of this textbook. Clear and intuitive explanations of the mathematical and statistical foundations make the algorithms transparent. But practical data analytics requires more than just the foundations. Problems and data are enormously variable and only the most elementary of algorithms can be used without modification. Programming fluency and experience with real and challenging data is indispensable and so the reader is immersed in Python and R and real data analysis. By the end of the book, the reader will have gained the ability to adapt algorithms to new problems and carry out innovative analyses.
This book has three parts:(a) Data Reduction: Begins with the concepts of data reduction, data maps, and information extraction. The second chapter introduces associative statistics, the mathematical foundation of scalable algorithms and distributed computing. Practical aspects of distributed computing is the subject of the Hadoop and MapReduce chapter.(b) Extracting Information from Data: Linear regression and data visualization are the principal topics of Part II. The authors dedicate a chapter to the critical domain of Healthcare Analytics for an extended example of practical data analytics. The algorithms and analytics will be of much interest to practitioners interested in utilizing the large and unwieldly data sets of the Centers for Disease Control and Prevention's Behavioral Risk Factor Surveillance System.(c) Predictive Analytics Two foundational and widely used algorithms, k-nearest neighbors and naive Bayes, are developed in detail. A chapter is dedicated to forecasting. The last chapter focuses on streaming data and uses publicly accessible data streams originating from the Twitter API and the NASDAQ stock market in the tutorials.
This book is intended for a one- or two-semester course in data analytics for upper-division undergraduate and graduate students in mathematics, statistics, and computer science. The prerequisites are kept low, and students with one or two courses in probability or statistics, an exposure to vectors and matrices, and a programming course will have no difficulty. The core material of every chapter is accessible to all with these prerequisites. The chapters often expand at the close with innovations of interest to practitioners of data science. Each chapter includes exercises of varying levels of difficulty. The text is eminently suitable for self-study and an exceptional resource for practitioners.

✦ Table of Contents


Front Matter....Pages i-xxiii
Introduction....Pages 1-16
Front Matter....Pages 17-17
Data Mapping and Data Dictionaries....Pages 19-50
Scalable Algorithms and Associative Statistics....Pages 51-104
Hadoop and MapReduce....Pages 105-129
Front Matter....Pages 131-131
Data Visualization....Pages 133-159
Linear Regression Methods....Pages 161-215
Healthcare Analytics....Pages 217-251
Cluster Analysis....Pages 253-275
Front Matter....Pages 277-277
k-Nearest Neighbor Prediction Functions....Pages 279-312
The Multinomial NaΓ―ve Bayes Prediction Function....Pages 313-342
Forecasting....Pages 343-379
Real-time Analytics....Pages 381-401
Back Matter....Pages 403-430

✦ Subjects


Data Mining and Knowledge Discovery;Statistics and Computing/Statistics Programs;Mathematics of Computing;Health Informatics


πŸ“œ SIMILAR VOLUMES


Algorithms for Data Science
✍ Brian Steele, John Chandler, Swarna Reddy πŸ“‚ Library πŸ“… 2017 πŸ› Springer 🌐 English

This textbook on practical data analytics unites fundamental principles, algorithms, and data. Algorithms are the keystone of data analytics and the focal point of this textbook. Clear and intuitive explanations of the mathematical and statistical foundations make the algorithms transparent. But pra

Machine Learning Algorithms Popular algo
✍ Giuseppe Bonaccorso πŸ“‚ Library πŸ“… 2018 πŸ› Packt 🌐 English

Machine learning has gained tremendous popularity for its powerful and fast predictions with large datasets. However, the true forces behind its powerful output are the complex algorithms involving substantial statistical analysis that churn large datasets and generate substantial insight. This s

Graph Algorithms for Data Science (MEAP
✍ TomaΕΎ Bratanič πŸ“‚ Library πŸ“… 2023 πŸ› Manning Publications 🌐 English

Graphs are the natural way to understand connected data. This book explores the most important algorithms and techniques for graphs in data science, with practical examples and concrete advice on implementation and deployment. In Graph Algorithms for Data Science you will learn Labeled-property

Graph Algorithms for Data Science MEAP V
✍ Tomaz Bratanic πŸ“‚ Library πŸ“… 2023 πŸ› Manning Publications 🌐 English

Graphs are the natural way to understand connected data. This book explores the most important algorithms and techniques for graphs in data science, with practical examples and concrete advice on implementation and deployment. In Graph Algorithms for Data Science you will learn: Labeled-property

Data Science Algorithms in a Week: Top 7
✍ David Natingga πŸ“‚ Library πŸ“… 2017 πŸ› Packt Publishing 🌐 English

<h4>Key Features</h4><ul><li>Get to know seven algorithms for your data science needs in this concise, insightful guide</li><li>Ensure you’re confident in the basics by learning when and where to use various data science algorithms</li><li>Learn to use machine learning algorithms in a period of just