Decoupled Optimisation for Long-Tailed Visual Recognition

Authors

  • Cong Cong School of Computer Science and Engineering, University of New South Wales, Sydney, Australia
  • Shiyu Xuan National Key Laboratory for Multimedia Information Processing, School of Computer Science, Peking University, China
  • Sidong Liu Australian Institute of Health Innovation, Macquarie University, Sydney, Australia
  • Shiliang Zhang National Key Laboratory for Multimedia Information Processing, School of Computer Science, Peking University, China
  • Maurice Pagnucco School of Computer Science and Engineering, University of New South Wales, Sydney, Australia
  • Yang Song School of Computer Science and Engineering, University of New South Wales, Sydney, Australia

DOI:

https://doi.org/10.1609/aaai.v38i2.27902

Keywords:

CV: Object Detection & Categorization, ML: Multi-class/Multi-label Learning & Extreme Classification

Abstract

When training on a long-tailed dataset, conventional learning algorithms tend to exhibit a bias towards classes with a larger sample size. Our investigation has revealed that this biased learning tendency originates from the model parameters, which are trained to disproportionately contribute to the classes characterised by their sample size (e.g., many, medium, and few classes). To balance the overall parameter contribution across all classes, we investigate the importance of each model parameter to the learning of different class groups, and propose a multistage parameter Decouple and Optimisation (DO) framework that decouples parameters into different groups with each group learning a specific portion of classes. To optimise the parameter learning, we apply different training objectives with a collaborative optimisation step to learn complementary information about each class group. Extensive experiments on long-tailed datasets, including CIFAR100, Places-LT, ImageNet-LT, and iNaturaList 2018, show that our framework achieves competitive performance compared to the state-of-the-art.

Published

2024-03-24

How to Cite

Cong, C., Xuan, S., Liu, S., Zhang, S., Pagnucco, M., & Song, Y. (2024). Decoupled Optimisation for Long-Tailed Visual Recognition. Proceedings of the AAAI Conference on Artificial Intelligence, 38(2), 1380–1388. https://doi.org/10.1609/aaai.v38i2.27902

Issue

Section

AAAI Technical Track on Computer Vision I