Topological aspects of information retrieval
β Scribed by Egghe, Leo ;Rousseau, Ronald
- Publisher
- John Wiley and Sons
- Year
- 1998
- Tongue
- English
- Weight
- 164 KB
- Volume
- 49
- Category
- Article
- ISSN
- 0002-8231
No coin nor oath required. For personal study only.
β¦ Synopsis
Let (DS, DQ, sim) be a retrieval system consisting of a document space DS, a query space QS, and a function sim, expressing the similarity between a document and a query. Following D. M. Everett and S. C. Cater (1992), we introduce topologies on the document space. These topologies are generated by the similarity function sim and the query space QS. Three topologies will be studied: The retrieval topology, the similarity topology, and the (pseudo-)metric one. It is shown that the retrieval topology is the coarsest of the three, while the (pseudo-) metric is the strongest. These three topologies are generally different, reflecting distinct topological aspects of information retrieval. We present necessary and sufficient conditions for these topological aspects to be equal. Several examples of topological retrieval systems are presented. One of these examples is a vector space model that yields a simplification of the Everett-Cater model, yet having a more diversified spectrum of topological properties. Finally, it is shown that information retrieval based on Boolean operators is an intrinsic part of the general topological model. This is a major motivation of the introduction of topologies in theoretical IR models.
π SIMILAR VOLUMES
## Introduction As is well known, every living organism has needs, the satisfaction of which is necessary for maintaining the organism's life and for its development. This applies especially to human beings. Any conscious activity in the last analysis is directed towards the satisfaction of needs
## Abstract We investigate connections between the syntactic and semantic distance of programs on an abstract, recursion theoretic level. For a certain rather restrictive notion of interdependency of the two kinds of distances, there remain only few and βunnaturalβ numberings allowing such close re
Sum ma ry ## Site-spec@ recombination events are of fundamental importance in many biological systems. In vitro experiments using purified proteins and D N A substrates are yielding insights into strand exchange mechanisms and synapsis of recombination sites. By examining results across a range o
The topological aspects of the conformational transformations in proteins are investigated using a new peptide-ribbon representation of the tertiary structure. The topological parameters evaluated on a set of 49 proteins show striking regularities that extend beyond the secondary structures actually