𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Statistical Language Models for Information Retrieval

✍ Scribed by ChengXiang Zhai


Publisher
Morgan and Claypool Publishers
Year
2008
Tongue
English
Leaves
141
Series
Synthesis Lectures on Human Language Technologies Volume 0
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


As online information grows dramatically, search engines such as Google are playing a more and more important role in our lives. Critical to all search engines is the problem of designing an effective retrieval model that can rank documents accurately for a given query. This has been a central research problem in information retrieval for several decades. In the past ten years, a new generation of retrieval models, often referred to as statistical language models, has been successfully applied to solve many different information retrieval problems. Compared with the traditional models such as the vector space model, these new models have a more sound statistical foundation and can leverage statistical estimation to optimize retrieval parameters. They can also be more easily adapted to model non-traditional and complex retrieval problems. Empirically, they tend to achieve comparable or better performance than a traditional model with less effort on parameter tuning. This book systematically reviews the large body of literature on applying statistical language models to information retrieval with an emphasis on the underlying principles, empirically effective language models, and language models developed for non-traditional retrieval tasks. All the relevant literature has been synthesized to make it easy for a reader to digest the research progress achieved so far and see the frontier of research in this area. The book also offers practitioners an informative introduction to a set of practically useful language models that can effectively solve a variety of retrieval problems. No prior knowledge about information retrieval is required, but some basic knowledge about probability and statistics would be useful for fully digesting all the details. Table of Contents: Introduction / Overview of Information Retrieval Models / Simple Query Likelihood Retrieval Model / Complex Query Likelihood Model / Probabilistic Distance Retrieval Model / Language Models for Special Retrieval Tasks / Language Models for Latent Topic Analysis / Conclusions


πŸ“œ SIMILAR VOLUMES


Statistical Language Models for Informat
✍ ChengXiang Zhai πŸ“‚ Library πŸ“… 2008 πŸ› Morgan and Claypool Publishers 🌐 English

As online information grows dramatically, search engines such as Google are playing a more and more important role in our lives. Critical to all search engines is the problem of designing an effective retrieval model that can rank documents accurately for a given query. This has been a central resea

Statistical Language Models for Informat
✍ Zhai CX. πŸ“‚ Library 🌐 English

Из сСрии Foundations and Trends in Information Retrieval ΠΈΠ·Π΄Π°Ρ‚Π΅Π»ΡŒΡΡ‚Π²Π° NOWPress, 2009, -77 pp.<div class="bb-sep"></div>Statistical language models have recently been successfully applied to many information retrieval problems. A great deal of recent work has shown that statistical language models no

Language Modeling for Information Retrie
✍ John Lafferty, ChengXiang Zhai (auth.), W. Bruce Croft, John Lafferty (eds.) πŸ“‚ Library πŸ“… 2003 πŸ› Springer Netherlands 🌐 English

<p>A statisticallanguage model, or more simply a language model, is a probΒ­ abilistic mechanism for generating text. Such adefinition is general enough to include an endless variety of schemes. However, a distinction should be made between generative models, which can in principle be used to synthes