RegMixMatch: Optimizing Mixup Utilization in Semi-Supervised Learning

Authors

  • Haorong Han Key Laboratory of Big Data and Artificial Intelligence in Transportation, Ministry of Education, China Beijing Jiaotong University
  • Jidong Yuan Key Laboratory of Big Data and Artificial Intelligence in Transportation, Ministry of Education, China Beijing Jiaotong University
  • Chixuan Wei Communication University of China
  • Zhongyang Yu Key Laboratory of Big Data and Artificial Intelligence in Transportation, Ministry of Education, China Beijing Jiaotong University

DOI:

https://doi.org/10.1609/aaai.v39i16.33872

Abstract

Consistency regularization and pseudo-labeling have significantly advanced semi-supervised learning (SSL). Prior works have effectively employed Mixup for consistency regularization in SSL. However, our findings indicate that applying Mixup for consistency regularization may degrade SSL performance by compromising the purity of artificial labels. Moreover, most pseudo-labeling based methods utilize thresholding strategy to exclude low-confidence data, aiming to mitigate confirmation bias; however, this approach limits the utility of unlabeled samples. To address these challenges, we propose RegMixMatch, a novel framework that optimizes the use of Mixup with both high- and low-confidence samples in SSL. First, we introduce semi-supervised RegMixup, which effectively addresses reduced artificial labels purity by using both mixed samples and clean samples for training. Second, we develop a class-aware Mixup technique that integrates information from the top-2 predicted classes into low-confidence samples and their artificial labels, reducing the confirmation bias associated with these samples and enhancing their effective utilization. Experimental results demonstrate that RegMixMatch achieves state-of-the-art performance across various SSL benchmarks.

Downloads

Published

2025-04-11

How to Cite

Han, H., Yuan, J., Wei, C., & Yu, Z. (2025). RegMixMatch: Optimizing Mixup Utilization in Semi-Supervised Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 39(16), 17032–17040. https://doi.org/10.1609/aaai.v39i16.33872

Issue

Section

AAAI Technical Track on Machine Learning II