Specification and Simulation of Statistical Query Algorithms for Efficiency and Noise Tolerance
โ Scribed by Javed A Aslam; Scott E Decatur
- Book ID
- 102971726
- Publisher
- Elsevier Science
- Year
- 1998
- Tongue
- English
- Weight
- 613 KB
- Volume
- 56
- Category
- Article
- ISSN
- 0022-0000
No coin nor oath required. For personal study only.
โฆ Synopsis
A recent innovation in computational learning theory is the statistical query (SQ) model. The advantage of specifying learning algorithms in this model is that SQ algorithms can be simulated in the probably approximately correct (PAC) model, both in the absence and in the presence of noise. However, simulations of SQ algorithms in the PAC model have non-optimal time and sample complexities. In this paper, we introduce a new method for specifying statistical query algorithms based on a type of relative error and provide simulations in the noisefree and noise-tolerant PAC models which yield more efficient algorithms. Requests for estimates of statistics in this new model take the following form: ``Return an estimate of the statistic within a 1+ factor, or return =, promising that the statistic is less than %.'' In addition to showing that this is a very natural language for specifying learning algorithms, we also show that this new specification is polynomially equivalent to standard SQ, and thus, known learnability and hardness results for statistical query learning are preserved.
We then give highly efficient PAC simulations of relative error SQ algorithms. We show that the learning algorithms obtained by simulating efficient relative error SQ algorithms both in the absence of noise and in the presence of malicious noise have roughly optimal sample complexity. We also show that the simulation of efficient relative error SQ algorithms in the presence of classification noise yields learning algorithms at least as efficient as those obtained through standard methods, and in some cases improved, roughly optimal results are achieved. The sample complexities for all of these simulations are based on the d & metric, which is a type of relative error metric useful for quantities which are small or even zero. We show that uniform convergence with respect to the d & metric yields ``uniform convergence'' with respect to (+, %) accuracy.
Finally, while we show that many specific learning algorithms can be written as highly efficient relative error SQ algorithms, we also show, in fact, that all SQ algorithms can be written efficiently by proving general upper bounds on the complexity of (+, %) queries as a function of the accuracy parameter =. As a consequence of this result, we give general upper bounds on the complexity of learning algorithms achieved through the use of relative error SQ algorithms and the simulations described above.
๐ SIMILAR VOLUMES
We describe two new parallel algorithms, one conservative and another optimistic, for discrete-event simulation on an exclusiveread exclusive-write parallel random-access machine (EREW PRAM). The target physical systems are bounded degree networks which are represented by logic circuits. Employing \