Clustering categorical data: an approach based on dynamical systems
โ Scribed by David Gibson; Jon Kleinberg; Prabhakar Raghavan
- Publisher
- Springer-Verlag
- Year
- 2000
- Tongue
- English
- Weight
- 209 KB
- Volume
- 8
- Category
- Article
- ISSN
- 1066-8888
No coin nor oath required. For personal study only.
โฆ Synopsis
We describe a novel approach for clustering collections of sets, and its application to the analysis and mining of categorical data. By "categorical data," we mean tables with fields that cannot be naturally ordered by a metrice.g., the names of producers of automobiles, or the names of products offered by a manufacturer. Our approach is based on an iterative method for assigning and propagating weights on the categorical values in a table; this facilitates a type of similarity measure arising from the co-occurrence of values in the dataset. Our techniques can be studied analytically in terms of certain types of non-linear dynamical systems.
๐ SIMILAR VOLUMES
In this paper, we present a wavelet based approach which tries to automatically find the number of clusters present in a data set, along with their position and statistical properties. The only information supplied to the method is the data set to analyze and a confidence level parameter. Most of th