๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Documents and queries as random variables: History and implications

โœ Scribed by David Bodoff; Samuel Po-Shing Wong


Publisher
John Wiley and Sons
Year
2006
Tongue
English
Weight
258 KB
Volume
57
Category
Article
ISSN
1532-2882

No coin nor oath required. For personal study only.

โœฆ Synopsis


Abstract

The view of documents and/or queries as random variables is gaining importance in the theory of information retrieval. We argue that traditional probabilistic models consider documents and queries as random variables, but that newer models such as language modeling and our unified model take this one step further. The additional step is called error in predictors. Such models consider that we don't observe the document and query random variables that are modeled to predict relevance probabilistically. Rather, there are additional random variables, which are the observed documents and queries. We discuss some important implications of this idea for parameter estimation, relevance prediction, and even testโ€collection construction. By clarifying the positions of various probabilistic models on this question, and presenting in one place many of its implications, this article aims to deepen our common understanding of the theories behind traditional probabilistic models, and to strengthen the theoretical basis for further development of more recent approaches such as language modeling.


๐Ÿ“œ SIMILAR VOLUMES


Environmental Variability and Semelparit
โœ ESA RANTA; DAVID TESAR; VEIJO KAITALA ๐Ÿ“‚ Article ๐Ÿ“… 2002 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 196 KB

Research on the evolution of life histories addresses the topic of fitness trade-offs between semelparity (reproducing once in a lifetime) and iteroparity (repeated reproductive bouts per lifetime). Bulmer (1994) derived the relationship v+P(A)<1 (P(A) is the adult survival;vb(S) and b(S) are the of

Extreme flow variability and the โ€˜boom a
โœ Angela H. Arthington; Stephen R. Balcombe ๐Ÿ“‚ Article ๐Ÿ“… 2011 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 300 KB ๐Ÿ‘ 1 views

## Abstract Floodplain rivers in arid and semiโ€arid regions may be the most threatened of all river systems because water resource developments typically dampen their most distinctive characteristicsโ€”extreme flow variability and โ€˜boom and bustโ€™ ecological dynamics. This article shows how one of the