Semi-supervised Learning with Support Isolation by Small-Paced Self-Training

Authors

  • Zheng Xie Nanjing University
  • Hui Sun Nanjing University
  • Ming Li Nanjing University

DOI:

https://doi.org/10.1609/aaai.v37i9.26249

Keywords:

ML: Semi-Supervised Learning

Abstract

In this paper, we address a special scenario of semi-supervised learning, where the label missing is caused by a preceding filtering mechanism, i.e., an instance can enter a subsequent process in which its label is revealed if and only if it passes the filtering mechanism. The rejected instances are prohibited to enter the subsequent labeling process due to economical or ethical reasons, making the support of the labeled and unlabeled distributions isolated from each other. In this case, semi-supervised learning approaches which rely on certain coherence of the labeled and unlabeled distribution would suffer from the consequent distribution mismatch, and hence result in poor prediction performance. In this paper, we propose a Small-Paced Self-Training framework, which iteratively discovers labeled and unlabeled instance subspaces with bounded Wasserstein distance. We theoretically prove that such a framework may achieve provably low error on the pseudo labels during learning. Experiments on both benchmark and pneumonia diagnosis tasks show that our method is effective.

Downloads

Published

2023-06-26

How to Cite

Xie, Z., Sun, H., & Li, M. (2023). Semi-supervised Learning with Support Isolation by Small-Paced Self-Training. Proceedings of the AAAI Conference on Artificial Intelligence, 37(9), 10510-10518. https://doi.org/10.1609/aaai.v37i9.26249

Issue

Section

AAAI Technical Track on Machine Learning IV