Probability Guided Loss for Long-Tailed Multi-Label Image Classification

Authors

  • Dekun Lin Chengdu Institute of Computer Applications, Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v37i2.25244

Keywords:

CV: Learning & Optimization for CV, CV: Applications, CV: Representation Learning for Vision

Abstract

Long-tailed learning has attracted increasing attention in very recent years. Long-tailed multi-label image classification is one subtask and remains challenging and poorly researched. In this paper, we provide a fresh perspective from probability to tackle this problem. More specifically, we find that existing cost-sensitive learning methods for long-tailed multi-label classification will affect the predicted probability of positive and negative labels in varying degrees during training, and different processes of probability will affect the final performance in turn. We thus propose a probability guided loss which contains two components to control this process. One is the probability re-balancing which can flexibly adjust the process of training probability. And the other is the adaptive probability-aware focal which can further reduce the probability gap between positive and negative labels. We conduct extensive experiments on two long-tailed multi-label image classification datasets: VOC-LT and COCO-LT. The results demonstrate the rationality and superiority of our strategy.

Downloads

Published

2023-06-26

How to Cite

Lin, D. (2023). Probability Guided Loss for Long-Tailed Multi-Label Image Classification. Proceedings of the AAAI Conference on Artificial Intelligence, 37(2), 1577-1585. https://doi.org/10.1609/aaai.v37i2.25244

Issue

Section

AAAI Technical Track on Computer Vision II