𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Algorithmic High-Dimensional Robust Statistics

✍ Scribed by ILIAS DIAKONIKOLAS; DANIEL KANE


Publisher
Cambridge University Press
Year
2023
Tongue
English
Leaves
301
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Robust statistics is the study of designing estimators that perform well even when the dataset significantly deviates from the idealized modeling assumptions, such as in the presence of model misspecification or adversarial outliers in the dataset. The classical statistical theory, dating back to pioneering works by Tukey and Huber, characterizes the information-theoretic limits of robust estimation for most common problems. A recent line of work in computer science gave the first computationally efficient robust estimators in high dimensions for a range of learning tasks. This reference text for graduate students, researchers, and professionals in machine learning theory, provides an overview of recent developments in algorithmic high-dimensional robust statistics, presenting the underlying ideas in a clear and unified manner, while leveraging new perspectives on the developed techniques to provide streamlined proofs of these results. The most basic and illustrative results are analyzed in each chapter, while more tangential developments are explored in the exercises.

✦ Table of Contents


Introduction to Robust Statistics 1
1.1 Introduction 1
1.2 Contamination Model 2
1.3 Information-Theoretic Limits 9
1.4 One-Dimensional Robust Estimation 12
1.5 Higher-Dimensional Robust Mean Estimation 17
1.6 Connection with Breakdown Point 20
1.7 Exercises 22
1.8 Discussion and Related Work 27
2 Ecient High-Dimensional Robust Mean Estimation 29
2.1 Introduction 29
2.2 Stability and Robust Mean Estimation 32
2.3 The Unknown Convex Programming Method 41
2.4 The Filtering Method 43
2.5 Exercises 54
2.6 Discussion and Related Work 58
3 Algorithmic Refinements in Robust Mean Estimation 61
3.1 Introduction 61
3.2 Near-Optimal Sample Complexity of Stability 62
3.3 Robust Mean Estimation in Near-Linear Time 72
3.4 Robust Mean Estimation with Additive or Subtractive
Corruptions 81
3.5 Robust Estimation via Nonconvex Optimization 88
3.6 Robust Sparse Mean Estimation 91
vii
viii Contents
3.7 Exercises 95
3.8 Discussion and Related Work 98
4 Robust Covariance Estimation 100
4.1 Introduction 100
4.2 Ecient Algorithm for Robust Covariance Estimation 101
4.3 Applications to Concrete Distribution Families 110
4.4 Reduction to Zero-Mean Case 113
4.5 Exercises 115
4.6 Discussion and Related Work 117
5 List-Decodable Learning 119
5.1 Introduction 119
5.2 Information-Theoretic Limits of List-Decodable Learning 125
5.3 Ecient Algorithms for List-Decodable Mean Estimation 133
5.4 Application: Learning Mixture Models 154
5.5 Exercises 161
5.6 Discussion and Related Work 164
6 Robust Estimation via Higher Moments 166
6.1 Introduction 166
6.2 Leveraging Higher-Degree Moments in List-Decodable
Learning 167
6.3 List-Decodable Learning via Variance of Polynomials 170
6.4 List-Decodable Learning via Sum of Squares 179
6.5 Leveraging Higher Moments in Robust Mean Estimation 186
6.6 Clustering Mixture Models via Higher-Degree Moments 190
6.7 Exercises 199
6.8 Discussion and Related Work 201
7 Robust Supervised Learning 204
7.1 Introduction 204
7.2 Outlier-Robust Algorithms for Linear Regression 205
7.3 Robust Learning of Linear Separators 209
7.4 Robust Stochastic Optimization 217
7.5 Exercises 222
7.6 Discussion and Related Work 226
8 Information-Computation Trade-o s in High-Dimensional
Robust Statistics 229
8.1 Introduction 229
8.2 Statistical Query Lower Bounds 230
8.3 Reduction-Based Computational Hardness 252
Contents ix
8.4 Exercises 259
8.5 Discussion and Related Work 263
Appendix Mathematical Background 265
A.1 Martingales 265
A.2 Hermite Polynomials 267
A.3 Probabilistic Inequalities 268
References 271
Index 282


πŸ“œ SIMILAR VOLUMES


Introduction to High-Dimensional Statist
✍ Christophe Giraud πŸ“‚ Library πŸ“… 2021 πŸ› Chapman and Hall/CRC 🌐 English

<p><strong>Praise for the first edition:</strong></p><p>"[This book] succeeds singularly at providing a structured introduction to this active field of research. … it is arguably the most accessible overview yet published of the mathematical ideas and principles that one needs to master to enter the

Introduction to High-Dimensional Statist
✍ Christophe Giraud πŸ“‚ Library πŸ“… 2014 πŸ› Chapman and Hall/CRC 🌐 English

<P>Ever-greater computing technologies have given rise to an exponentially growing volume of data. Today massive data sets (with potentially thousands of variables) play an important role in almost every branch of modern human activity, including networks, finance, and genetics. However, analyzing s

Multivariate Statistical Analysis: A Hig
✍ V. Serdobolskii (auth.) πŸ“‚ Library πŸ“… 2000 πŸ› Springer Netherlands 🌐 English

<p>In the last few decades the accumulation of large amounts of inΒ­ formation in numerous applications. has stimtllated an increased inΒ­ terest in multivariate analysis. Computer technologies allow one to use multi-dimensional and multi-parametric models successfully. At the same time, an interest a

High-Dimensional Statistics A Non-Asympt
✍ Martin J. Wainwright πŸ“‚ Library πŸ“… 2019 πŸ› Cambridge University Press 🌐 English

Recent years have seen an explosion in the volume and variety of data collected in scientific disciplines from astronomy to genetics and industrial settings ranging from Amazon to Uber. This graduate text equips readers in statistics, machine learning, and related fields to understand, apply, and ad

High-dimensional statistics: a non-asymp
✍ Wainwright, Martin J πŸ“‚ Library πŸ“… 2019 πŸ› Cambridge University Press 🌐 English

Recent years have witnessed an explosion in the volume and variety of data collected in all scientific disciplines and industrial settings. Such massive data sets present a number of challenges to researchers in statistics and machine learning. This book provides a self-contained introduction to the