Papadimitriou introduced several classes of NP search problems based on combinatorial principles which guarantee the existence of solutions to the problems. Many interesting search problems not known to be solvable in polynomial time are contained in these classes, and a number of them are complete
Sample Complexity of Model-Based Search
โ Scribed by Christopher D. Rosin
- Publisher
- Elsevier Science
- Year
- 2000
- Tongue
- English
- Weight
- 194 KB
- Volume
- 60
- Category
- Article
- ISSN
- 0022-0000
No coin nor oath required. For personal study only.
โฆ Synopsis
We consider the problem of searching a domain for points that have a desired property, in the special case where the objective function that determines the properties of points is unknown and must be learned during search. We give a parallel to PAC learning theory that is appropriate for reasoning about the sample complexity of this problem. The learner queries the true objective function at selected points, and uses this information to choose models of the objective function from a given hypothesis class that is known to contain a correct model. These models are used to focus the search on more promising areas of the domain. The goal is to find a point with the desired property in a small number of queries. We define an analog to VC dimension, needle dimension, to be the size of the largest sample in which any single point could have the desired property without the other points' values revealing this information. We give an upper bound on sample complexity that is linear in needle dimension for a natural type of search protocol and a linear lower bound for a class of constrained problems. We also describe the relationship between needle dimension and VC dimension, explore connections between model-based search and active concept learning (including several novel positive results in active learning), and consider a scale-sensitive version of needle dimension. Several simple examples illustrate the dependence of needle dimension on features of search problems.
๐ SIMILAR VOLUMES
This paper analyzes the complexity of heuristic search algorithms, Le. algorithms which find the shortest path in a graph by using an estimate to guide the search. In particular, .algorithm A\*, due to Hart, Nilsson and Raphael, is shown to require 0(2 ~) steps, in the worst cdse, for searching a gr
A climate system is a complex nonlinear system. Estimation of the complexity is of great interest in climatic forecast and prediction. In this paper, we propose the use of sample entropy (SampEn), an entropy-based algorithm, to measure the complexity of daily temperature series. Estimations of SampE
We present a new general upper bound on the number of examples required to estimate all of the expectations of a set of random variables uniformly well. The quality of the estimates is measured using a variant of the relative error proposed by Haussler and Pollard. We also show that our bound is wit
A new methodology is introduced for spatial sampling design when the variable of interest cannot be directly observed, but information on it can be obtained by sampling a related variable, and estimation of the underlying model is required. An approach based on entropy has been proposed by Bueso, An