On clustering massive text and categorical data streams
β Scribed by Charu C. Aggarwal; Philip S. Yu
- Publisher
- Springer-Verlag
- Year
- 2009
- Tongue
- English
- Weight
- 785 KB
- Volume
- 24
- Category
- Article
- ISSN
- 0219-1377
No coin nor oath required. For personal study only.
π SIMILAR VOLUMES
In this paper we study the use of a semi-supervised agglomerative hierarchical clustering Ε½ . ssAHC algorithm to text categorization, which consists of assigning text documents to Ε½ . Ε½ . predefined categories. ssAHC is i a clustering algorithm that ii uses a finite design set Ε½ . Ε½ . of labeled dat
We describe a novel approach for clustering collections of sets, and its application to the analysis and mining of categorical data. By "categorical data," we mean tables with fields that cannot be naturally ordered by a metrice.g., the names of producers of automobiles, or the names of products off