A theory of proximity based clustering: structure detection by optimization
β Scribed by Jan Puzicha; Thomas Hofmann; Joachim M. Buhmann
- Publisher
- Elsevier Science
- Year
- 2000
- Tongue
- English
- Weight
- 822 KB
- Volume
- 33
- Category
- Article
- ISSN
- 0031-3203
No coin nor oath required. For personal study only.
β¦ Synopsis
In this paper, a systematic optimization approach for clustering proximity or similarity data is developed. Starting from fundamental invariance and robustness properties, a set of axioms is proposed and discussed to distinguish di!erent cluster compactness and separation criteria. The approach covers the case of sparse proximity matrices, and is extended to nested partitionings for hierarchical data clustering. To solve the associated optimization problems, a rigorous mathematical framework for deterministic annealing and mean-xeld approximation is presented. E$cient optimization heuristics are derived in a canonical way, which also clari"es the relation to stochastic optimization by Gibbs sampling. Similarity-based clustering techniques have a broad range of possible applications in computer vision, pattern recognition, and data analysis. As a major practical application we present a novel approach to the problem of unsupervised texture segmentation, which relies on statistical tests as a measure of homogeneity. The quality of the algorithms is empirically evaluated on a large collection of Brodatz-like micro-texture Mondrians and on a set of real}word images. To demonstrate the broad usefulness of the theory of proximity based clustering the performances of di!erent criteria and algorithms are compared on an information retrieval task for a document database. The superiority of optimization algorithms for clustering is supported by extensive experiments.
π SIMILAR VOLUMES
## Abstract A powerful combination of molecular beacon and luminescence resonance energy transfer technology reveals alterations in nucleic acid structure by as little as a single nucleotide in a novel hybridization proximity assay. The assay measures the length of a singleβstranded target when a t