✦ LIBER ✦

Outlier detection for high dimensional data

✍ Scribed by Aggarwal, Charu C.; Yu, Philip S.

Book ID: 124155560
Publisher: Association for Computing Machinery
Year: 2001
Tongue: English
Weight: 193 KB
Volume: 30
Category: Article
ISSN: 0163-5808
DOI: 10.1145/376284.375668

No coin nor oath required. For personal study only.

✦ Synopsis

The outlier detection problem has important applications in the field of fraud detection, network robustness analysis, and intrusion detection. Most such applications are high dimensional domains in which the data can contain hundreds of dimensions. Many recent algorithms use concepts of proximity in order to find outliers based on their relationship to the rest of the data. However, in high dimensional space, the data is sparse and the notion of proximity fails to retain its meaningfulness. In fact, the sparsity of high dimensional data implies that every point is an almost equally good outlier from the perspective of proximity-based definitions. Consequently, for high dimensional data, the notion of finding meaningful outliers becomes substantially more complex and non-obvious. In this paper, we discuss new techniques for outlier detection which find the outliers by studying the behavior of projections from the data set.

📜 SIMILAR VOLUMES

[ACM Press the 2001 ACM SIGMOD internati

[ACM Press the 2001 ACM SIGMOD international conference - Santa Barbara, California, United States (2001.05.21-2001.05.24)] Proceedings of the 2001 ACM SIGMOD international conference on Management of data - SIGMOD '01 - Outlier detection for high dimensional data

✍ Aggarwal, Charu C.; Yu, Philip S. 📂 Article 📅 2001 🏛 ACM Press 🌐 English ⚖ 193 KB

A kernel-based approach for detecting ou

A kernel-based approach for detecting outliers of high-dimensional biological data

✍ Jung Hun Oh; Jean Gao 📂 Article 📅 2009 🏛 BioMed Central 🌐 English ⚖ 372 KB

Non-derivable itemsets for fast outlier

Non-derivable itemsets for fast outlier detection in large high-dimensional categorical data

✍ Anna Koufakou; Jimmy Secretan; Michael Georgiopoulos 📂 Article 📅 2010 🏛 Springer-Verlag 🌐 English ⚖ 621 KB

[IEEE 2008 IEEE 24th International Confe

[IEEE 2008 IEEE 24th International Conference on Data Engineering (ICDE 2008) - Cancun, Mexico (2008.04.7-2008.04.12)] 2008 IEEE 24th International Conference on Data Engineering - SPOT: A System for Detecting Projected Outliers From High-dimensional Data Streams

✍ Zhang, Ji; Gao, Qigang; Wang, Hai 📂 Article 📅 2008 🏛 IEEE ⚖ 667 KB

[ACM Press the 18th ACM SIGKDD internati

[ACM Press the 18th ACM SIGKDD international conference - Beijing, China (2012.08.12-2012.08.16)] Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '12 - A near-linear time approximation algorithm for angle-based outlier detection in high-dimensional data

✍ Pham, Ninh; Pagh, Rasmus 📂 Article 📅 2012 🏛 ACM Press 🌐 English ⚖ 680 KB

Outlier Analysis || High-Dimensional Out

Outlier Analysis || High-Dimensional Outlier Detection: The Subspace Method

✍ Aggarwal, Charu C. 📂 Article 📅 2012 🏛 Springer New York ⚖ 660 KB