๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Detecting outlier samples in multivariate time series dataset

โœ Scribed by Xiaoqing Weng; Junyi Shen


Publisher
Elsevier Science
Year
2008
Tongue
English
Weight
260 KB
Volume
21
Category
Article
ISSN
0950-7051

No coin nor oath required. For personal study only.

โœฆ Synopsis


Multivariate time series (MTS) samples which differ significantly from other MTS samples are referred to as outlier samples. In this paper, an algorithm designed to efficiently detect the top n outlier samples in MTS dataset, based on Solving Set, is proposed. An extended Frobenius Norm is used to compute the distance between MTS samples. The outlier score of MTS sample is the sum of the distances from its k nearest neighbors. The time complexity of the algorithm is subquadratic. We conduct experiments on two real-world datasets, stock market dataset and BCI (Brain Computer Interface) dataset. The experiment results show the efficiency and effectiveness of the algorithm.


๐Ÿ“œ SIMILAR VOLUMES


Finding key attribute subset in dataset
โœ Peng Yang; Qingsheng Zhu ๐Ÿ“‚ Article ๐Ÿ“… 2011 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 270 KB

Detection of outlier from high dimensional dataset have found important applications in many fields, yet the unexpected time consumption is likely to hinder its practical use. Thus, it makes sense to build an efficient method for finding meaningful outliers and analyzing their intentional knowledge.

Sample Partial Autocorrelation Function
โœ S. Degerine ๐Ÿ“‚ Article ๐Ÿ“… 1994 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 743 KB

The choice of a matrix square root in order to define a correlation coefficient is crucial for the notion of partial autocorrelation function (PACF) for a multivariate time series. Here this topic is revisited and, introducing a new matrix link coefficient between two random vectors, a general frame

On Rohlfโ€ฒs Method for the Detection of O
โœ C. Caroni; P. Prescott ๐Ÿ“‚ Article ๐Ÿ“… 1995 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 510 KB

Rohlf (1975, Biometrics 31, 93-101) proposed a method of detecting outliers in multivariate data by testing the largest edge of the minimum spanning tree. It is shown here that tests against the gamma distribution are extremely liberal. Furthermore, results depend on the correlation structure of the

Detection of outliers and level shifts i
โœ Kjell Vaage ๐Ÿ“‚ Article ๐Ÿ“… 2000 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 180 KB

A uniยฎed method to detect and handle innovational and additive outliers, and permanent and transient level changes has been presented by R. S. Tsay. N. S. Balke has found that the presence of level changes may lead to misidentiยฎcation and loss of test-power, and suggests augmenting Tsay's procedure