Multidimensional fuzzy partitioning of attribute ranges for mining quantitative data
✍ Scribed by Attila Gyenesei; Jukka Teuhola
- Publisher
- John Wiley and Sons
- Year
- 2004
- Tongue
- English
- Weight
- 264 KB
- Volume
- 19
- Category
- Article
- ISSN
- 0884-8173
No coin nor oath required. For personal study only.
✦ Synopsis
The article suggests a partitioning algorithm for quantitative attributes to support the discovery of frequent fuzzy patterns among transactions containing such attributes. More precisely, we present a heuristic, multivariate, top-down partitioning algorithm that divides attribute ranges into such intervals that the discovered frequent sets are also dense, and thus probably more interesting to the user. Our approach is fuzzy, so that the derived intervals have fuzzy bounds, and thereby also the derived frequent sets are fuzzy. The crisp (nonfuzzy) case is obtained as a special case. We evaluate the goodness of the partitioning method by measuring the average and absolute information amounts of the obtained fuzzy frequent sets. For the mining task, any fuzzy frequent item set mining method can be used. Experiments show that the algorithm is able to do multidimensional partitioning in a balanced way, and the "interestingness" of the obtained frequent sets is quite high, especially for correlated attributes.