Detecting outlier samples in multivariate time series dataset
โ Scribed by Xiaoqing Weng; Junyi Shen
- Publisher
- Elsevier Science
- Year
- 2008
- Tongue
- English
- Weight
- 260 KB
- Volume
- 21
- Category
- Article
- ISSN
- 0950-7051
No coin nor oath required. For personal study only.
โฆ Synopsis
Multivariate time series (MTS) samples which differ significantly from other MTS samples are referred to as outlier samples. In this paper, an algorithm designed to efficiently detect the top n outlier samples in MTS dataset, based on Solving Set, is proposed. An extended Frobenius Norm is used to compute the distance between MTS samples. The outlier score of MTS sample is the sum of the distances from its k nearest neighbors. The time complexity of the algorithm is subquadratic. We conduct experiments on two real-world datasets, stock market dataset and BCI (Brain Computer Interface) dataset. The experiment results show the efficiency and effectiveness of the algorithm.
๐ SIMILAR VOLUMES
Detection of outlier from high dimensional dataset have found important applications in many fields, yet the unexpected time consumption is likely to hinder its practical use. Thus, it makes sense to build an efficient method for finding meaningful outliers and analyzing their intentional knowledge.
The choice of a matrix square root in order to define a correlation coefficient is crucial for the notion of partial autocorrelation function (PACF) for a multivariate time series. Here this topic is revisited and, introducing a new matrix link coefficient between two random vectors, a general frame
Rohlf (1975, Biometrics 31, 93-101) proposed a method of detecting outliers in multivariate data by testing the largest edge of the minimum spanning tree. It is shown here that tests against the gamma distribution are extremely liberal. Furthermore, results depend on the correlation structure of the
A uniยฎed method to detect and handle innovational and additive outliers, and permanent and transient level changes has been presented by R. S. Tsay. N. S. Balke has found that the presence of level changes may lead to misidentiยฎcation and loss of test-power, and suggests augmenting Tsay's procedure