<p><span>This groundbreaking book transcends traditional machine learning approaches by introducing information measurement methodologies that revolutionize the field.</span></p><p><span>Stemming from a UC Berkeley seminar on experimental design for machine learning tasks, these techniques aim to ov
Information-Driven Machine Learning : Data Science as an Engineering Discipline
โ Scribed by Gerald Friedland
- Publisher
- Springer International Publishing
- Year
- 2023
- Tongue
- English
- Leaves
- 289
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
This groundbreaking book transcends traditional machine learning approaches by introducing information measurement methodologies that revolutionize the field.
Stemming from a UC Berkeley seminar on experimental design for machine learning tasks, these techniques aim to overcome the 'black box' approach of machine learning by reducing conjectures such as magic numbers (hyper-parameters) or model-type bias. Information-based machine learning enables data quality measurements, a priori task complexity estimations, and reproducible design of data science experiments. The benefits include significant size reduction, increased explainability, and enhanced resilience of models, all contributing to advancing the discipline's robustness and credibility.
While bridging the gap between machine learning and disciplines such as physics, information theory, and computer engineering, this textbook maintains an accessible and comprehensive style, making complex topics digestible for a broad readership. Information-Driven Machine Learning explores the synergistic harmony among these disciplines to enhance our understanding of data science modeling. Instead of solely focusing on the "how," this text provides answers to the "why" questions that permeate the field, shedding light on the underlying principles of machine learning processes and their practical implications. By advocating for systematic methodologies grounded in fundamental principles, this book challenges industry practices that have often evolved from ideologic or profit-driven motivations. It addresses a range of topics, including deep learning, data drift, and MLOps, using fundamental principles such as entropy, capacity, and high dimensionality.
โฆ Table of Contents
Cover
Front Matter
1. Introduction
2. The Automated Scientific Process
3. The (Black Box) Machine Learning Process
4. Information Theory
5. Capacity
6. The Mechanics of Generalization
7. Meta-Math: Exploring the Limits of Modeling
8. Capacity of Neural Networks
9. Neural Network Architectures
10. Capacities of Some Other Machine Learning Methods
11. Data Collection and Preparation
12. Measuring Data Sufficiency
13. Machine Learning Operations
14. Explainability
15. Repeatability and Reproducibility
16. The Curse of Training and the Blessing of High Dimensionality
17. Machine Learning and Society
Back Matter
๐ SIMILAR VOLUMES
Data-driven discovery is revolutionizing the modeling, prediction, and control of complex systems. This textbook brings together machine learning, engineering mathematics, and mathematical physics to integrate modeling and control of dynamical systems with modern methods in data science. It highligh
Data-driven discovery is revolutionizing the modeling, prediction, and control of complex systems. This textbook brings together machine learning, engineering mathematics, and mathematical physics to integrate modeling and control of dynamical systems with modern methods in data science. It highligh
Data Driven Science & Engineering. 2017.