Toward Robustness in Multi-Label Classification: A Data Augmentation Strategy against Imbalance and Noise

Authors

  • Hwanjun Song KAIST
  • Minseok Kim Amazon
  • Jae-Gil Lee KAIST

DOI:

https://doi.org/10.1609/aaai.v38i19.30157

Keywords:

General

Abstract

Multi-label classification poses challenges due to imbalanced and noisy labels in training data. In this paper, we propose a unified data augmentation method, named BalanceMix, to address these challenges. Our approach includes two samplers for imbalanced labels, generating minority-augmented instances with high diversity. It also refines multi-labels at the label-wise granularity, categorizing noisy labels as clean, re-labeled, or ambiguous for robust optimization. Extensive experiments on three benchmark datasets demonstrate that BalanceMix outperforms existing state-of-the-art methods. We release the code at https://github.com/DISL-Lab/BalanceMix.

Published

2024-03-24

How to Cite

Song, H., Kim, M., & Lee, J.-G. (2024). Toward Robustness in Multi-Label Classification: A Data Augmentation Strategy against Imbalance and Noise. Proceedings of the AAAI Conference on Artificial Intelligence, 38(19), 21592-21601. https://doi.org/10.1609/aaai.v38i19.30157

Issue

Section

AAAI Technical Track on Safe, Robust and Responsible AI Track