SSE-SAM: Balancing Head and Tail Classes Gradually Through Stage-Wise SAM

Authors

  • Xingyu Lyu Key Lab. of Intelligent Information Processing, Institute of Computing Tech., CAS School of Computer Science and Tech., University of Chinese Academy of Sciences
  • Qianqian Xu Key Lab. of Intelligent Information Processing, Institute of Computing Tech., CAS
  • Zhiyong Yang School of Computer Science and Tech., University of Chinese Academy of Sciences
  • Shaojie Lyu Tencent Corporate
  • Qingming Huang School of Computer Science and Tech., University of Chinese Academy of Sciences Key Lab. of Intelligent Information Processing, Institute of Computing Tech., CAS BDKM, University of Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v39i18.34122

Abstract

Real-world datasets often exhibit a long-tailed distribution, where vast majority of classes known as tail classes have only few samples. Traditional methods tend to overfit on these tail classes. Recently, a new approach called Imbalanced SAM (ImbSAM) is proposed to leverage the generalization benefits of Sharpness-Aware Minimization (SAM) for long-tailed distributions. The main strategy is to merely enhance the smoothness of the loss function for tail classes. However, we argue that improving generalization in long-tail scenarios requires a careful balance between head and tail classes. We show that neither SAM nor ImbSAM alone can fully achieve this balance. For SAM, we prove that although it enhances the model's generalization ability by  escaping saddle point in the overall loss landscape, it does not effectively address this for tail-class losses. Conversely, while ImbSAM is more effective at avoiding saddle points in tail classes, the head classes are trained insufficiently, resulting in significant performance drops. Based on these insights, we propose Stage-wise Saddle Escaping SAM (SSE-SAM), which uses complementary strengths of ImbSAM and SAM in a phased approach. Initially, SSE-SAM follows the majority sample to avoid saddle points of the head-class loss. During the later phase, it focuses on tail-classes to help them escape saddle points. Our experiments confirm that SSE-SAM has better ability in escaping saddles both on head and tail classes, and shows performance improvements.

Downloads

Published

2025-04-11

How to Cite

Lyu, X., Xu, Q., Yang, Z., Lyu, S., & Huang, Q. (2025). SSE-SAM: Balancing Head and Tail Classes Gradually Through Stage-Wise SAM. Proceedings of the AAAI Conference on Artificial Intelligence, 39(18), 19278–19286. https://doi.org/10.1609/aaai.v39i18.34122

Issue

Section

AAAI Technical Track on Machine Learning IV