Consider an area subdivided into non-overlapping districts, e.g. a state divided into counties, and assume that some districts are marked for having some distinguishing property. Then the question arises whether the marked districts are distributed randomly or exhibit some spatial clustering. This q
Clustering with obstacles for Geographical Data Mining
β Scribed by Vladimir Estivill-Castro; Ickjai Lee
- Publisher
- Elsevier Science
- Year
- 2004
- Tongue
- English
- Weight
- 381 KB
- Volume
- 59
- Category
- Article
- ISSN
- 0924-2716
No coin nor oath required. For personal study only.
β¦ Synopsis
Clustering algorithms typically use the Euclidean distance. However, spatial proximity is dependent on obstacles, caused by related information in other layers of the spatial database. We present a clustering algorithm suitable for large spatial databases with obstacles. The algorithm is free of user-supplied arguments and incorporates global and local variations. The algorithm detects clusters in complex scenarios and successfully supports association analysis between layers. All this occurs within O(n log n+[s + t] log n) expected time, where n is the number of points, s is the number of line segments that determine the obstacles and t is the number of Delaunay edges intersecting the obstacles.
π SIMILAR VOLUMES
In this paper we study the use of a semi-supervised agglomerative hierarchical clustering Ε½ . ssAHC algorithm to text categorization, which consists of assigning text documents to Ε½ . Ε½ . predefined categories. ssAHC is i a clustering algorithm that ii uses a finite design set Ε½ . Ε½ . of labeled dat
Study designs in public health research often require the estimation of intervention effects that have been applied to a cluster of subjects in a common geographic area, rather than randomly assigned to individual subjects, and where the outcome is dichotomous. Statistical methods that account for t
In solving clustering problem, traditional methods, for example, the K-means algorithm and its variants, usually ask the user to provide the number of clusters. Unfortunately, the number of clusters in general is unknown to the user. The traditional neighborhood clustering algorithm usually needs th