𝔖 Bobbio Scriptorium
✦   LIBER   ✦

[ACM Press the Third International Workshop - Paris, France (2009.06.28-2009.06.28)] Proceedings of the Third International Workshop on Knowledge Discovery from Sensor Data - SensorKDD '09 - OcVFDT

✍ Scribed by Li, Chen; Zhang, Yang; Li, Xue


Book ID
126250180
Publisher
ACM Press
Year
2009
Weight
215 KB
Category
Article
ISBN
1605586684

No coin nor oath required. For personal study only.

✦ Synopsis


Current research on data stream classification mainly focuses on supervised learning, in which a fully labeled data stream is needed for training. However, fully labeled data streams are expensive to obtain, which make the supervised learning approach difficult to be applied to real-life applications. In this paper, we model applications, such as credit fraud detection and intrusion detection, as a one-class data stream classification problem. The cost of fully labeling the data stream is reduced as users only need to provide some positive samples together with the unlabeled samples to the learner. Based on VFDT and POSC4.5, we propose our OcVFDT (One-class Very Fast Decision Tree) algorithm. Experimental study on both synthetic and real-life datasets shows that the OcVFDT has excellent classification performance. Even 80% of the samples in data stream are unlabeled, the classification performance of OcVFDT is still very close to that of VFDT, which is trained on fully labeled stream.


πŸ“œ SIMILAR VOLUMES