𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Interactive clustering and merging with a new fuzzy expected value

✍ Scribed by Carl G. Looney


Publisher
Elsevier Science
Year
2002
Tongue
English
Weight
537 KB
Volume
35
Category
Article
ISSN
0031-3203

No coin nor oath required. For personal study only.

✦ Synopsis


Major problems exist in both crisp and fuzzy clustering algorithms. The fuzzy c-means type of algorithms use weights determined by a power m of inverse distances that remains ÿxed over all iterations and over all clusters, even though smaller clusters should have a larger m. Our method uses a di erent "distance" for each cluster that changes over the early iterations to ÿt the clusters. Comparisons show improved results. We also address other perplexing problems in clustering: (i) ÿnd the optimal number K of clusters; (ii) assess the validity of a given clustering; (iii) prevent the selection of seed vectors as initial prototypes from a ecting the clustering; (iv) prevent the order of merging from a ecting the clustering; and (v) permit the clusters to form more natural shapes rather than forcing them into normed balls of the distance function. We employ a relatively large number K of uniformly randomly distributed seeds and then thin them to leave fewer uniformly distributed seeds. Next, the main loop iterates by assigning the feature vectors and computing new fuzzy prototypes. Our fuzzy merging then merges any clusters that are too close to each other. We use a modiÿed Xie-Bene validity measure as the goodness of clustering measure for multiple values of K in a user-interaction approach where the user selects two parameters (for eliminating clusters and merging clusters after viewing the results thus far). The algorithm is compared with the fuzzy c-means on the iris data and on the Wisconsin breast cancer data.