Sampling Control for Imbalanced Calibration in Semi-Supervised Learning

Authors

  • Senmao Tian Beijing Jiaotong University
  • Xiang Wei Beijing Jiaotong University
  • Shunli Zhang Beijing Jiaotong University

DOI:

https://doi.org/10.1609/aaai.v40i31.39791

Abstract

Class imbalance remains a critical challenge in semi-supervised learning (SSL), especially when distributional mismatches between labeled and unlabeled data lead to biased classification. Although existing methods address this issue by adjusting logits based on the estimated class distribution of unlabeled data, they often handle model imbalance in a coarse-grained manner, conflating data imbalance with bias arising from varying class-specific learning difficulties. To address this issue, we propose a unified framework, SC-SSL, which suppresses model bias through decoupled sampling control. During training, we identify the key variables for sampling control under ideal conditions. By introducing a classifier with explicit expansion capability and adaptively adjusting sampling probabilities across different data distributions, SC-SSL mitigates feature-level imbalance for minority classes. In the inference phase, we further analyze the weight imbalance of the linear classifier and apply post-hoc sampling control with an optimization bias vector to directly calibrate the logits. Extensive experiments across various benchmark datasets and distribution settings validate the consistency and state-of-the-art performance of SC-SSL.

Downloads

Published

2026-03-14

How to Cite

Tian, S., Wei, X., & Zhang, S. (2026). Sampling Control for Imbalanced Calibration in Semi-Supervised Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(31), 25914–25922. https://doi.org/10.1609/aaai.v40i31.39791

Issue

Section

AAAI Technical Track on Machine Learning VIII