The Monte Carlo method and the evaluation of retrieval system performance
β Scribed by Burgin, Robert
- Publisher
- John Wiley and Sons
- Year
- 1999
- Tongue
- English
- Weight
- 82 KB
- Volume
- 50
- Category
- Article
- ISSN
- 0002-8231
No coin nor oath required. For personal study only.
β¦ Synopsis
The ability to distinguish between acceptable and unacceptable levels of retrieval performance and the ability to distinguish between significant and non-significant differences between retrieval results are important to traditional information retrieval experiments. The Monte Carlo method is shown to represent an attractive alternative to the hypergeometric model for identifying the levels at which random retrieval performance is exceeded in retrieval test collections and for overcoming some of the limitations of the hypergeometric model. The Monte Carlo method produces low performance thresholds for the individual test collections that are very similar to the thresholds derived by the hypergeometric model, both at the test collection level and at the individual query level. In addition, the Monte Carlo method is much less computer-intensive than the hypergeometric model, can be used with measures of retrieval effectiveness that take the rank order of the retrieved documents into consideration, can be used to derive the probability of obtained results, and can be used to determine the statistical significance of difference between two or more retrieval results. The ability to use the Monte Carlo method to derive the probability of obtained results and to compare two or more retrieval results makes it possible to determine more accurately how well retrieval systems operate under specific conditions and, in conjunction with the presentation of individual query results, makes it possible to determine whether relationships between query characteristics and retrieval system performance exist. Understanding these relationships should lead to improvements in the effectiveness of retrieval systems.
π SIMILAR VOLUMES
The Laplace transform is applied to remove the time-dependent variable in the di usion equation. For nonharmonic initial conditions this gives rise to a non-homogeneous modiΓΏed Helmholtz equation which we solve by the method of fundamental solutions. To do this a particular solution must be obtained
Experimental kineticists are always faced with the problem of reducing kinetic data to extract physically meaningful information. A particularly vexing problem arises when different models reproduce the data but yield different values for the physical parameters. For over forty-five years Monte Carl
In this paper we investigate the multi-period forecast performance of a number of empirical self-exciting threshold autoregressive (SETAR) models that have been proposed in the literature for modelling exchange rates and GNP, among other variables. We take each of the empirical SETAR models in turn
Three inferential morphometric methods, Euclidean distance matrix analysis (EDMA), Bookstein's edge-matching method (EMM), and the Procrustes method, were applied to facial landmark data. A Monte Carlo simulation was conducted with three sample sizes, ranging from n = 10 to 50, to assess type I erro