SAND: Semi-Supervised Adaptive Novel Class Detection and Classification over Data Stream

Authors

  • Ahsanul Haque The University of Texas at Dallas
  • Latifur Khan The University of Texas at Dallas
  • Michael Baron The University of Texas at Dallas

DOI:

https://doi.org/10.1609/aaai.v30i1.10283

Keywords:

Dynamic Chunk Size, Classifier Confidence, Limited Labeled Data, Concept Drift, Concept Evolution

Abstract

Most approaches to classifying data streams either divide the stream into fixed-size chunks or use gradual forgetting. Due to evolving nature of data streams, finding a proper size or choosing a forgetting rate without prior knowledge about time-scale of change is not a trivial task. These approaches hence suffer from a trade-off between performance and sensitivity. Existing dynamic sliding window based approaches address this problem by tracking changes in classifier error rate, but are supervised in nature. We propose an efficient semi-supervised framework in this paper which uses change detection on classifier confidence to detect concept drifts, and to determine chunk boundaries dynamically. It also addresses concept evolution problem by detecting outliers having strong cohesion among themselves. Experiment results on benchmark and synthetic data sets show effectiveness of the proposed approach.

Downloads

Published

2016-02-21

How to Cite

Haque, A., Khan, L., & Baron, M. (2016). SAND: Semi-Supervised Adaptive Novel Class Detection and Classification over Data Stream. Proceedings of the AAAI Conference on Artificial Intelligence, 30(1). https://doi.org/10.1609/aaai.v30i1.10283

Issue

Section

Technical Papers: Machine Learning Methods