Testing the cluster hypothesis in distributed information retrieval
β Scribed by Fabio Crestani; Shengli Wu
- Book ID
- 113663584
- Publisher
- Elsevier Science
- Year
- 2006
- Tongue
- English
- Weight
- 170 KB
- Volume
- 42
- Category
- Article
- ISSN
- 0306-4573
No coin nor oath required. For personal study only.
π SIMILAR VOLUMES
## Abstract Information retrieval (IR) may be considered an instance of a common modern statistical problem: a massive simultaneous hypothesis test. Such problems arise often in biostatistics where plentiful data must be winnowed to name a small number of potentially βinterestingβ cases. For instan
This paper presents a cluster algorithm that defines the number of clusters and allows classification of data points. The basic task of the algorithm is to identify accumulations of vectors in the analysis sample of vectors. The accumulations of vectors are determined by testing the statistical hypo