Let A be a finite set of points in general position in Rd, d 2 2. Points a,, a2, , a, E A form an n-hole in A (an empty convex subset of A) if they are vertices of a convex polytope containing no other point of A. Let h(d) denote the maximum number h such that any sufficiently large set of points in
Mining for empty spaces in large data sets
✍ Scribed by Jeff Edmonds; Jarek Gryz; Dongming Liang; Renée J. Miller
- Publisher
- Elsevier Science
- Year
- 2003
- Tongue
- English
- Weight
- 166 KB
- Volume
- 296
- Category
- Article
- ISSN
- 0304-3975
No coin nor oath required. For personal study only.
✦ Synopsis
Many data mining approaches focus on the discovery of similar (and frequent) data values in large data sets. We present an alternative, but complementary approach in which we search for empty regions in the data. We consider the problem of ÿnding all maximal empty rectangles in large, two-dimensional data sets. We introduce a novel, scalable algorithm for ÿnding all such rectangles. The algorithm achieves this with a single scan over a sorted data set and requires only a small bounded amount of memory. We extend the algorithm to ÿnd all maximal empty hyper-rectangles in a multi-dimensional space. We consider the complexity of this search problem and present new bounds on the number of maximal empty hyper-rectangles. We brie y overview experimental results obtained by applying our algorithm to real and synthetic data sets and describe one application of empty-space knowledge to query optimization.
📜 SIMILAR VOLUMES
Uncertainty management is necessary for real world applications, especially those used with data mining. The Region Connection Calculus (RCC) and egg-yolk methods have proven useful for the representation of vague regions in spatial data. Rough set theory has been shown to be an effective tool for d