✦ LIBER ✦

Specification and Simulation of Statistical Query Algorithms for Efficiency and Noise Tolerance

✍ Scribed by Javed A Aslam; Scott E Decatur

Book ID: 102971726
Publisher: Elsevier Science
Year: 1998
Tongue: English
Weight: 613 KB
Volume: 56
Category: Article
ISSN: 0022-0000
DOI: 10.1006/jcss.1997.1558

No coin nor oath required. For personal study only.

✦ Synopsis

A recent innovation in computational learning theory is the statistical query (SQ) model. The advantage of specifying learning algorithms in this model is that SQ algorithms can be simulated in the probably approximately correct (PAC) model, both in the absence and in the presence of noise. However, simulations of SQ algorithms in the PAC model have non-optimal time and sample complexities. In this paper, we introduce a new method for specifying statistical query algorithms based on a type of relative error and provide simulations in the noisefree and noise-tolerant PAC models which yield more efficient algorithms. Requests for estimates of statistics in this new model take the following form: ``Return an estimate of the statistic within a 1+ factor, or return =, promising that the statistic is less than %.'' In addition to showing that this is a very natural language for specifying learning algorithms, we also show that this new specification is polynomially equivalent to standard SQ, and thus, known learnability and hardness results for statistical query learning are preserved.

We then give highly efficient PAC simulations of relative error SQ algorithms. We show that the learning algorithms obtained by simulating efficient relative error SQ algorithms both in the absence of noise and in the presence of malicious noise have roughly optimal sample complexity. We also show that the simulation of efficient relative error SQ algorithms in the presence of classification noise yields learning algorithms at least as efficient as those obtained through standard methods, and in some cases improved, roughly optimal results are achieved. The sample complexities for all of these simulations are based on the d & metric, which is a type of relative error metric useful for quantities which are small or even zero. We show that uniform convergence with respect to the d & metric yields ``uniform convergence'' with respect to (+, %) accuracy.

Finally, while we show that many specific learning algorithms can be written as highly efficient relative error SQ algorithms, we also show, in fact, that all SQ algorithms can be written efficiently by proving general upper bounds on the complexity of (+, %) queries as a function of the accuracy parameter =. As a consequence of this result, we give general upper bounds on the complexity of learning algorithms achieved through the use of relative error SQ algorithms and the simulations described above.

📜 SIMILAR VOLUMES

On the Efficiency of Noise-Tolerant PAC

On the Efficiency of Noise-Tolerant PAC Algorithms Derived from Statistical Queries

✍ Jeffrey Jackson 📂 Article 📅 2003 🏛 Springer Netherlands 🌐 English ⚖ 173 KB

Algorithms for exact and approximate sta

Algorithms for exact and approximate statistical simulation of Poisson ensembles

✍ T. A. Averina; G. A. Mikhailov 📂 Article 📅 2010 🏛 SP MAIK Nauka/Interperiodica 🌐 English ⚖ 274 KB

Efficient and Scalable PRAM Algorithms f

Efficient and Scalable PRAM Algorithms for Discrete-Event Simulation of Bounded Degree Networks

✍ S. Prasad 📂 Article 📅 1993 🏛 Elsevier Science 🌐 English ⚖ 603 KB

We describe two new parallel algorithms, one conservative and another optimistic, for discrete-event simulation on an exclusiveread exclusive-write parallel random-access machine (EREW PRAM). The target physical systems are bounded degree networks which are represented by logic circuits. Employing \

Efficiency of the Incomplete Enumeration

Efficiency of the Incomplete Enumeration Algorithm for Monte-Carlo Simulation of Linear and Branched Polymers

✍ Sumedha; Deepak Dhar 📂 Article 📅 2005 🏛 Springer 🌐 English ⚖ 328 KB

A Simulation Study of Joint Uses of Data

A Simulation Study of Joint Uses of Data Envelopment Analysis and Statistical Regressions for Production Function Estimation and Efficiency Evaluation

✍ Indranil R. Bardhan; William W. Cooper; Subal C. Kumbhakar 📂 Article 📅 1998 🏛 Springer 🌐 English ⚖ 180 KB

A study for improving detection efficien

A study for improving detection efficiency of an HPGe detector based gamma spectrometer using Monte Carlo simulation and genetic algorithms

✍ Ngo Quang Huy; Do Quang Binh; Vo Xuan An 📂 Article 📅 2012 🏛 Elsevier Science 🌐 English ⚖ 789 KB