Inverse Weight-Balancing for Deep Long-Tailed Learning

Authors

  • Wenqi Dang Xidian University
  • Zhou Yang Xidian University
  • Weisheng Dong Xidian University
  • Xin Li West Virginia University
  • Guangming Shi Xidian University Peng Cheng Laboratory

DOI:

https://doi.org/10.1609/aaai.v38i10.29055

Keywords:

ML: Classification and Regression, CV: Medical and Biological Imaging

Abstract

The performance of deep learning models often degrades rapidly when faced with imbalanced data characterized by a long-tailed distribution. Researchers have found that the fully connected layer trained by cross-entropy loss has large weight-norms for classes with many samples, but not for classes with few samples. How to address the data imbalance problem with both the encoder and the classifier seems an under-researched problem. In this paper, we propose an inverse weight-balancing (IWB) approach to guide model training and alleviate the data imbalance problem in two stages. In the first stage, an encoder and classifier (the fully connected layer) are trained using conventional cross-entropy loss. In the second stage, with a fixed encoder, the classifier is finetuned through an adaptive distribution for IWB in the decision space. Unlike existing inverse image frequency that implements a multiplicative margin adjustment transformation in the classification layer, our approach can be interpreted as an adaptive distribution alignment strategy using not only the class-wise number distribution but also the sample-wise difficulty distribution in both encoder and classifier. Experiments show that our method can greatly improve performance on imbalanced datasets such as CIFAR100-LT with different imbalance factors, ImageNet-LT, and iNaturelists2018.

Published

2024-03-24

How to Cite

Dang, W., Yang, Z., Dong, W., Li, X., & Shi, G. (2024). Inverse Weight-Balancing for Deep Long-Tailed Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 38(10), 11713-11721. https://doi.org/10.1609/aaai.v38i10.29055

Issue

Section

AAAI Technical Track on Machine Learning I