## Abstract Information retrieval systems' ability to retrieve highly relevant documents has become more and more important in the age of extremely large collections, such as the World Wide Web (WWW). The authors' aim was to find out how corpus‐based cross‐language information retrieval (CLIR) mana
Categorization-driven cross-language retrieval of medical information
✍ Scribed by Hermes R. Freitas-Junior; Berthier Ribeiro-Neto; Rodrigo F. Vale; Alberto H. F. Laender; Luciano R. S. Lima
- Book ID
- 101654484
- Publisher
- John Wiley and Sons
- Year
- 2006
- Tongue
- English
- Weight
- 154 KB
- Volume
- 57
- Category
- Article
- ISSN
- 1532-2882
No coin nor oath required. For personal study only.
✦ Synopsis
Abstract
The Web has become a large repository of documents (or pages) written in many different languages. In this context, traditional information retrieval (IR) techniques cannot be used whenever the user query and the documents being retrieved are in different languages. To address this problem, new cross‐language information retrieval (CLIR) techniques have been proposed. In this work, we describe a method for cross‐language retrieval of medical information. This method combines query terms and related medical concepts obtained automatically through a categorization procedure. The medical concepts are used to create a linguistic abstraction that allows retrieval of information in a language‐independent way, minimizing linguistic problems such as polysemy. To evaluate our method, we carried out experiments using the OHSUMED test collection, whose documents are written in English, with queries expressed in Portuguese, Spanish, and French. The results indicate that our cross‐language retrieval method is as effective as a standard vector space model algorithm operating on queries and documents in the same language. Further, our results are better than previous results in the literature.
📜 SIMILAR VOLUMES
## Abstract This paper proposes a Retrieval Outcome Analysis Framework, or ROA Framework, to systematically evaluate retrieval performance of Cross‐Language Information Retrieval systems. The ROA framework goes beyond TREC‐type retrieval evaluation methodology by including procedures focusing on in