Is the Information Bottleneck Robust Enough? Towards Label-Noise Resistant Information Bottleneck Learning

Authors

  • Yi Huang SKLCCSE, School of Computer Science and Engineering, Beihang University, Beijing, China
  • Qingyun Sun SKLCCSE, School of Computer Science and Engineering, Beihang University, Beijing, China
  • Yisen Gao Department of Computer Science and Engineering, HKUST, Hong Kong, China
  • Haonan Yuan SKLCCSE, School of Computer Science and Engineering, Beihang University, Beijing, China
  • Xingcheng Fu Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, China
  • Jianxin Li SKLCCSE, School of Computer Science and Engineering, Beihang University, Beijing, China

DOI:

https://doi.org/10.1609/aaai.v40i26.39363

Abstract

The Information Bottleneck (IB) principle facilitates effective representation learning by preserving label-relevant information while compressing irrelevant information. However, its strong reliance on accurate labels makes it inherently vulnerable to label noise, prevalent in real-world scenarios, resulting in significant performance degradation and overfitting. To address this issue, we propose LaT-IB, a novel Label-Noise ResistanT Information Bottleneck method which introduces a "Minimal-Sufficient-Clean" (MSC) criterion. Instantiated as a mutual information regularizer to retain task-relevant information while discarding noise, MSC addresses standard IB’s vulnerability to noisy label supervision. To achieve this, LaT-IB employs a noise-aware latent disentanglement that decomposes the latent representation into components aligned with to the clean label space and the noise space. Theoretically, we first derive mutual information bounds for each component of our objective including prediction, compression, and disentanglement, and moreover prove that optimizing it encourages representations invariant to input noise and separates clean and noisy label information. Furthermore, we design a three-phase training framework: Warmup, Knowledge Injection and Robust Training, to progressively guide the model toward noise-resistant representations. Extensive experiments demonstrate that LaT-IB achieves superior robustness and efficiency under label noise, significantly enhancing robustness and applicability in real-world scenarios with label noise.

Downloads

Published

2026-03-14

How to Cite

Huang, Y., Sun, Q., Gao, Y., Yuan, H., Fu, X., & Li, J. (2026). Is the Information Bottleneck Robust Enough? Towards Label-Noise Resistant Information Bottleneck Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(26), 22084–22091. https://doi.org/10.1609/aaai.v40i26.39363

Issue

Section

AAAI Technical Track on Machine Learning III