𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Information Retrieval Models: Foundations and Relationships

✍ Scribed by Thomas Roelleke


Publisher
Morgan & Claypool Publishers
Year
2013
Tongue
English
Leaves
163
Series
Synthesis Lectures on Information Concepts, Retrieval, and Services
Edition
1st
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Information Retrieval (IR) models are a core component of IR research and IR systems. The past decade brought a consolidation of the family of IR models, which by 2000 consisted of relatively isolated views on TF-IDF (Term-Frequency times Inverse-Document-Frequency) as the weighting scheme in the vector-space model (VSM), the probabilistic relevance framework (PRF), the binary independence retrieval (BIR) model, BM25 (Best-Match Version 25, the main instantiation of the PRF/BIR), and language modelling (LM). Also, the early 2000s saw the arrival of divergence from randomness (DFR).

Regarding intuition and simplicity, though LM is clear from a probabilistic point of view, several people stated: "It is easy to understand TF-IDF and BM25. For LM, however, we understand the math, but we do not fully understand why it works."

This book takes a horizontal approach gathering the foundations of TF-IDF, PRF, BIR, Poisson, BM25, LM, probabilistic inference networks (PIN's), and divergence-based models. The aim is to create a consolidated and balanced view on the main models.

A particular focus of this book is on the "relationships between models." This includes an overview over the main frameworks (PRF, logical IR, VSM, generalized VSM) and a pairing of TF-IDF with other models. It becomes evident that TF-IDF and LM measure the same, namely the dependence (overlap) between document and query. The Poisson probability helps to establish probabilistic, non-heuristic roots for TF-IDF, and the Poisson parameter, average term frequency, is a binding link between several retrieval models and model parameters.

Table of Contents: List of Figures / Preface / Acknowledgments / Introduction / Foundations of IR Models / Relationships Between IR Models / Summary & Research Outlook / Bibliography / Author's Biography / Index


πŸ“œ SIMILAR VOLUMES


Music Retrieval (Foundations and Trends
✍ Nicola Orio πŸ“‚ Library πŸ“… 2006 🌐 English

Music Accessing and Retrieval is the first comprehensive survey of the vast new field of Music Information Retrieval (MIR). It describes a number of issues which are peculiar to the language of music - including forms, formats, and dimensions of music - together with the typologies of users and thei

Music Retrieval (Foundations and Trends
✍ Nicola Orio πŸ“‚ Library πŸ“… 2006 🌐 English

Music Accessing and Retrieval is the first comprehensive survey of the vast new field of Music Information Retrieval (MIR). It describes a number of issues which are peculiar to the language of music - including forms, formats, and dimensions of music - together with the typologies of users and thei

Credibility in Information Retrieval (Fo
✍ Alexandru L. Ginsca, Adrian Popescu, Mihai Lupu πŸ“‚ Library πŸ“… 2015 πŸ› Now Publishers Inc 🌐 English

Credibility in Information Retrieval presents a detailed analysis of existing credibility models from different information seeking research areas, with a focus on the Web and its pervasive social component. It shows that there is a very rich body of work pertaining to different aspects and interpre

Mathematical Foundations of Information
✍ SΓ‘ndor Dominich (auth.) πŸ“‚ Library πŸ“… 2001 πŸ› Springer Netherlands 🌐 English

This book offers a comprehensive and consistent mathematical approach to information retrieval (IR) without which no implementation is possible, and sheds an entirely new light upon the structure of IR models. It contains the descriptions of all IR models in a unified formal style and language, alon

Information Retrieval: Uncertainty and L
✍ Cornelis Joost van Rijsbergen (auth.), Fabio Crestani, Mounia Lalmas, Cornelis J πŸ“‚ Library πŸ“… 1998 πŸ› Springer US 🌐 English

<p>In recent years, there have been several attempts to define a logic for information retrieval (IR). The aim was to provide a rich and uniform representation of information and its semantics with the goal of improving retrieval effectiveness. The basis of a logical model for IR is the assumption t