Online Learning in Variable Feature Spaces under Incomplete Supervision

Authors

  • Yi He University of Louisiana at Lafayette
  • Xu Yuan University of Louisiana at Lafayette
  • Sheng Chen University of Louisiana at Lafayette
  • Xindong Wu HeFei University of Technology

Keywords:

Data Stream Mining

Abstract

This paper explores a new online learning problem where the input sequence lives in an over-time varying feature space and the ground-truth label of any input point is given only occasionally, making online learners less restrictive and more applicable. The crux in this setting lies in how to exploit the very limited labels to efficiently update the online learners. Plausible ideas such as propagating labels from labeled points to their neighbors through uncovering the point-wise geometric relations face two challenges: (1) distance measurement fails to work as different points may be described by disparate sets of features and (2) storing the geometric shape, which is formed by all arrived points, is unrealistic in an online setting. To address these challenges, we first construct a universal feature space that accumulates all observed features, making distance measurement feasible. Then, we use manifolds to represent the geometric shapes and approximate them in a sparse means, making manifolds computational and memory tractable in online learning. We frame these two building blocks into a regularized risk minimization algorithm. Theoretical analysis and empirical evidence substantiate the viability and effectiveness of our proposal.

Downloads

Published

2021-05-18

How to Cite

He, Y., Yuan, X., Chen, S., & Wu, X. (2021). Online Learning in Variable Feature Spaces under Incomplete Supervision. Proceedings of the AAAI Conference on Artificial Intelligence, 35(5), 4106-4114. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/16532

Issue

Section

AAAI Technical Track on Data Mining and Knowledge Management