Unknown-Aware Graph Regularization for Robust Semi-supervised Learning from Uncurated Data

Authors

  • Heejo Kong Dept. of Brain and Cognitive Engineering, Korea University
  • Suneung Kim Dept. of Artificial Intelligence, Korea University
  • Ho-Joong Kim Dept. of Artificial Intelligence, Korea University
  • Seong-Whan Lee Dept. of Artificial Intelligence, Korea University

DOI:

https://doi.org/10.1609/aaai.v38i12.29227

Keywords:

ML: Semi-Supervised Learning, ML: Applications, ML: Deep Learning Algorithms

Abstract

Recent advances in semi-supervised learning (SSL) have relied on the optimistic assumption that labeled and unlabeled data share the same class distribution. However, this assumption is often violated in real-world scenarios, where unlabeled data may contain out-of-class samples. SSL with such uncurated unlabeled data leads training models to be corrupted. In this paper, we propose a robust SSL method for learning from uncurated real-world data within the context of open-set semi-supervised learning (OSSL). Unlike previous works that rely on feature similarity distance, our method exploits uncertainty in logits. By leveraging task-dependent predictions of logits, our method is capable of robust learning even in the presence of highly correlated outliers. Our key contribution is to present an unknown-aware graph regularization (UAG), a novel technique that enhances the performance of uncertainty-based OSSL frameworks. The technique addresses not only the conflict between training objectives for inliers and outliers but also the limitation of applying the same training rule for all outlier classes, which are existed on previous uncertainty-based approaches. Extensive experiments demonstrate that UAG surpasses state-of-the-art OSSL methods by a large margin across various protocols. Codes are available at https://github.com/heejokong/UAGreg.

Published

2024-03-24

How to Cite

Kong, H., Kim, S., Kim, H.-J., & Lee, S.-W. (2024). Unknown-Aware Graph Regularization for Robust Semi-supervised Learning from Uncurated Data. Proceedings of the AAAI Conference on Artificial Intelligence, 38(12), 13265–13273. https://doi.org/10.1609/aaai.v38i12.29227

Issue

Section

AAAI Technical Track on Machine Learning III