A mathematical model of retrieval system performance
β Scribed by McCarn, Davis B. ;Lewis, Craig M.
- Publisher
- John Wiley and Sons
- Year
- 1990
- Tongue
- English
- Weight
- 536 KB
- Volume
- 41
- Category
- Article
- ISSN
- 0002-8231
No coin nor oath required. For personal study only.
β¦ Synopsis
There have been a number of major evaluations of the performance of retrieval systems against large full text and surrogate (bibliographic) databases. These evaluations have concentrated on the experimental determination of the Precision Ratio, the fraction of retrieved items that are relevant to an information request, and the Flecall Ratio, the fraction of the total number of relevant items that were actually retrieved. While these measures have met with general acceptance, they have also generated much controversy.
The purpose of this article is to review the results of several of the largest evaluations and to propose a simple model for the performance of such systems that may help explain the relationship between these measures and user behavior.
Recall and Precision Summary
Data
Recall and Precision summary data from several studies are displayed in Table . These will be discussed in more detail in the "Evaluation Studies" section of this article, but it is apparent that recall results are quite different among studies. Various factors may influence study results, including the type and size of database, whether a controlled vocabulary was used, and which of the many database and search variables that might affect recall were studied.
Evaluation Studies
The two "Cranfield" evaluation studies by Cyril Cleverdon in 1962 and 1966 pioneered the area of information systems evaluation. A similar study was done on a large
π SIMILAR VOLUMES
An idealized mathematical model of adaptive-system behavior, using conventional economic decision-theory assumptions, is stated. The data-processing capacity of this model is analyzed, revealing that it would be rapidly overwhelmed by even moderate complexity. To circumvent this difficulty, a practi
## Abstract In the present paper, a new mathematical model describing the physical, chemical and biological phenomena involved in the process of contaminant removal in biofilters is developed. In addition to the contaminant, the key components of the present theoretical model are carbon dioxide and
A unified mathematical definition for the classical (vector and probabilistic) models of information retrieval is given. Also, a mathematical structure (Diophantine set) behind relevance feedback is identified.
## Introduction hypertext-based card catalog without prior instruction and that they searched as effectively when using a Boolean-Hypertext is a relatively new mechanism by which based catalog. bibliographic searching may be performed. Searcher con-Meadow, Wang, and Yuan (1995) examined perforfide