𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Clustering with obstacles for Geographical Data Mining

✍ Scribed by Vladimir Estivill-Castro; Ickjai Lee


Publisher
Elsevier Science
Year
2004
Tongue
English
Weight
381 KB
Volume
59
Category
Article
ISSN
0924-2716

No coin nor oath required. For personal study only.

✦ Synopsis


Clustering algorithms typically use the Euclidean distance. However, spatial proximity is dependent on obstacles, caused by related information in other layers of the spatial database. We present a clustering algorithm suitable for large spatial databases with obstacles. The algorithm is free of user-supplied arguments and incorporates global and local variations. The algorithm detects clusters in complex scenarios and successfully supports association analysis between layers. All this occurs within O(n log n+[s + t] log n) expected time, where n is the number of points, s is the number of line segments that determine the obstacles and t is the number of Delaunay edges intersecting the obstacles.


πŸ“œ SIMILAR VOLUMES


Cluster tests for geographical areas wit
✍ Friedrich Gebhardt πŸ“‚ Article πŸ“… 1999 πŸ› Elsevier Science 🌐 English βš– 436 KB

Consider an area subdivided into non-overlapping districts, e.g. a state divided into counties, and assume that some districts are marked for having some distinguishing property. Then the question arises whether the marked districts are distributed randomly or exhibit some spatial clustering. This q

Data mining for text categorization with
✍ Antonio GΓ³mez Skarmeta; Amine Bensaid; Nadia Tazi πŸ“‚ Article πŸ“… 2000 πŸ› John Wiley and Sons 🌐 English βš– 119 KB

In this paper we study the use of a semi-supervised agglomerative hierarchical clustering Ε½ . ssAHC algorithm to text categorization, which consists of assigning text documents to Ε½ . Ε½ . predefined categories. ssAHC is i a clustering algorithm that ii uses a finite design set Ε½ . Ε½ . of labeled dat

POWER DETERMINATION FOR GEOGRAPHICALLY C
✍ SCOTT A. HENDRICKS; JAMES T. WASSELL; JAMES W. COLLINS; SUZANNE L. SEDLAK πŸ“‚ Article πŸ“… 1996 πŸ› John Wiley and Sons 🌐 English βš– 631 KB

Study designs in public health research often require the estimation of intervention effects that have been applied to a cluster of subjects in a common geographic area, rather than randomly assigned to individual subjects, and where the outcome is dichotomous. Statistical methods that account for t

A genetic clustering algorithm for data
✍ Lin Yu Tseng; Shiueng Bien Yang πŸ“‚ Article πŸ“… 2000 πŸ› Elsevier Science 🌐 English βš– 266 KB

In solving clustering problem, traditional methods, for example, the K-means algorithm and its variants, usually ask the user to provide the number of clusters. Unfortunately, the number of clusters in general is unknown to the user. The traditional neighborhood clustering algorithm usually needs th