For information retrieval in the Chinese language, three representation methods for texts are popular, namely: 1-gram or character, bigram, and short-word. Each has its advantages as well as drawbacks. Employing more than one method may combine advantages from them and enhance retrieval effectivenes
A “stereo” document representation for textual information retrieval
✍ Scribed by Liang Chen; Jia Zeng; Naoyuki Tokuda
- Publisher
- John Wiley and Sons
- Year
- 2006
- Tongue
- English
- Weight
- 234 KB
- Volume
- 57
- Category
- Article
- ISSN
- 1532-2882
No coin nor oath required. For personal study only.
✦ Synopsis
Abstract
A new document representation model is presented in this paper. This model is based on the idea of representing a document by two or more pictures of the document taken from different perspectives. It is shown that by applying the stereo representation model, enhanced textual retrieval performance is achieved because the new model improves the capability of capturing individual features of the document. Experiments have been conducted on two standard corpora, TIME and ADI, using the standard term vector method and the latent semantic indexing (LSI) method based upon both the stereo representation model and the traditional representation model. Statistical t‐tests on the experimental results have convincingly illustrated that these methods achieve significant improvements in retrieval performances with the stereo representation model over those with the traditional representation model.
📜 SIMILAR VOLUMES
This article investigates the potentialities of using empir-to locate articles that discuss the relationships between ical variables and their associated statistical relationemotional depression and self-esteem. Using the PsychIships in document representation and retrieval. To this nfo database she
## man, 1993). Arabic provides a very different context National Conferences as a source. All these abstracts from English, since it is a non-Indo-European language involve computer science and information systems. We with a complex morphological structure. also designed and built an automatic inf