Cross-Language Information Retrieval
β Scribed by Gregory Grefenstette (auth.), Gregory Grefenstette (eds.)
- Publisher
- Springer US
- Year
- 1998
- Tongue
- English
- Leaves
- 189
- Series
- The Springer International Series on Information Retrieval 2
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
Most of the papers in this volume were first presented at the Workshop on Cross-Linguistic Information Retrieval that was held August 22, 1996 durΒ ing the SIGIR'96 Conference. Alan Smeaton of Dublin University and Paraic Sheridan of the ETH, Zurich, were the two other members of the Scientific Committee for this workshop. SIGIR is the Association for Computing MaΒ chinery (ACM) Special Interest Group on Information Retrieval, and they have held conferences yearly since 1977. Three additional papers have been added: Chapter 4 Distributed Cross-Lingual Information retrieval describes the EMIR retrieval system, one of the first general cross-language systems to be implemented and evaluated; Chapter 6 Mapping Vocabularies Using Latent Semantic Indexing, which originally appeared as a technical report in the LabΒ oratory for Computational Linguistics at Carnegie Mellon University in 1991, is included here because it was one of the earliest, though hard-to-find, publiΒ cations showing the application of Latent Semantic Indexing to the problem of cross-language retrieval; and Chapter 10 A Weighted Boolean Model for CrossΒ Language Text Retrieval describes a recent approach to solving the translation term weighting problem, specific to Cross-Language Information Retrieval. Gregory Grefenstette CONTRIBUTORS Lisa Ballesteros David Hull W, Bruce Croft Gregory Grefenstette Center for Intelligent Xerox Research Centre Europe Information Retrieval Grenoble Laboratory Computer Science Department University of Massachusetts Thomas K. Landauer Department of Psychology Mark W. Davis and Institute of Cognitive Science Computing Research Lab University of Colorado, Boulder New Mexico State University Michael L. Littman Bonnie J.
β¦ Table of Contents
Front Matter....Pages i-xi
The Problem of Cross-Language Information Retrieval....Pages 1-9
On The Effective Use of Large Parallel Corpora in Cross-Language Text Retrieval....Pages 11-22
Statistical Methods for Cross-Language Information Retrieval....Pages 23-40
Distributed Cross-Lingual Information Retrieval....Pages 41-50
Automatic Cross-Language Information Retrieval Using Latent Semantic Indexing....Pages 51-62
Mapping Vocabularies Using Latent Semantics....Pages 63-80
Cross-Language Information Retrieval: A System for Comparable Corpus Querying....Pages 81-92
A Language Conversion Front-End for Cross-Language Information Retrieval....Pages 93-104
The Systran NLP Browser: An Application of Machine Translation Technology in Cross-Language Information Retrieval....Pages 105-118
A Weighted Boolean Model for Cross-Language Text Retrieval....Pages 119-136
Building a Large Multilingual Test Collection from Comparable News Documents....Pages 137-150
Evaluating Cross-Language Text Filtering Effectiveness....Pages 151-161
Back Matter....Pages 163-182
β¦ Subjects
Information Storage and Retrieval; Language Translation and Linguistics; Artificial Intelligence (incl. Robotics); Data Structures, Cryptology and Information Theory
π SIMILAR VOLUMES
Search for information is no longer exclusively limited within the native language of the user, but is more and more extended to other languages. This gives rise to the problem of cross-language information retrieval (CLIR), whose goal is to find relevant information written in a different langua
<p>The last decade has been one of dramatic progress in the field of Natural Language Processing (NLP). This hitherto largely academic discipline has found itself at the center of an information revolution ushered in by the Internet age, as demand for human-computer communication and informaΒ tion a
<p>The last decade has been one of dramatic progress in the field of Natural Language Processing (NLP). This hitherto largely academic discipline has found itself at the center of an information revolution ushered in by the Internet age, as demand for human-computer communication and informaΒ tion a
<p>A statisticallanguage model, or more simply a language model, is a probΒ abilistic mechanism for generating text. Such adefinition is general enough to include an endless variety of schemes. However, a distinction should be made between generative models, which can in principle be used to synthes
As online information grows dramatically, search engines such as Google are playing a more and more important role in our lives. Critical to all search engines is the problem of designing an effective retrieval model that can rank documents accurately for a given query. This has been a central resea