Music Accessing and Retrieval is the first comprehensive survey of the vast new field of Music Information Retrieval (MIR). It describes a number of issues which are peculiar to the language of music - including forms, formats, and dimensions of music - together with the typologies of users and thei
Information Retrieval Models: Foundations and Relationships
β Scribed by Thomas Roelleke
- Publisher
- Morgan & Claypool Publishers
- Year
- 2013
- Tongue
- English
- Leaves
- 163
- Series
- Synthesis Lectures on Information Concepts, Retrieval, and Services
- Edition
- 1st
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
Information Retrieval (IR) models are a core component of IR research and IR systems. The past decade brought a consolidation of the family of IR models, which by 2000 consisted of relatively isolated views on TF-IDF (Term-Frequency times Inverse-Document-Frequency) as the weighting scheme in the vector-space model (VSM), the probabilistic relevance framework (PRF), the binary independence retrieval (BIR) model, BM25 (Best-Match Version 25, the main instantiation of the PRF/BIR), and language modelling (LM). Also, the early 2000s saw the arrival of divergence from randomness (DFR).
Regarding intuition and simplicity, though LM is clear from a probabilistic point of view, several people stated: "It is easy to understand TF-IDF and BM25. For LM, however, we understand the math, but we do not fully understand why it works."
This book takes a horizontal approach gathering the foundations of TF-IDF, PRF, BIR, Poisson, BM25, LM, probabilistic inference networks (PIN's), and divergence-based models. The aim is to create a consolidated and balanced view on the main models.
A particular focus of this book is on the "relationships between models." This includes an overview over the main frameworks (PRF, logical IR, VSM, generalized VSM) and a pairing of TF-IDF with other models. It becomes evident that TF-IDF and LM measure the same, namely the dependence (overlap) between document and query. The Poisson probability helps to establish probabilistic, non-heuristic roots for TF-IDF, and the Poisson parameter, average term frequency, is a binding link between several retrieval models and model parameters.
Table of Contents: List of Figures / Preface / Acknowledgments / Introduction / Foundations of IR Models / Relationships Between IR Models / Summary & Research Outlook / Bibliography / Author's Biography / Index
π SIMILAR VOLUMES
Music Accessing and Retrieval is the first comprehensive survey of the vast new field of Music Information Retrieval (MIR). It describes a number of issues which are peculiar to the language of music - including forms, formats, and dimensions of music - together with the typologies of users and thei
Credibility in Information Retrieval presents a detailed analysis of existing credibility models from different information seeking research areas, with a focus on the Web and its pervasive social component. It shows that there is a very rich body of work pertaining to different aspects and interpre
This book offers a comprehensive and consistent mathematical approach to information retrieval (IR) without which no implementation is possible, and sheds an entirely new light upon the structure of IR models. It contains the descriptions of all IR models in a unified formal style and language, alon
<p>In recent years, there have been several attempts to define a logic for information retrieval (IR). The aim was to provide a rich and uniform representation of information and its semantics with the goal of improving retrieval effectiveness. The basis of a logical model for IR is the assumption t