Neural Collapse-Informed Initialization with Perturbation Injection in Classification-based Metric Learning

Authors

  • Jinhee Park Korea Electronics Technology Institute, Korea Chung-Ang University, Korea
  • Hee Bin Yoo Seoul National University, Korea
  • Minjun Kim Chung-Ang University, Korea
  • Byoung-Tak Zhang Seoul National University, Korea
  • Junseok Kwon Chung-Ang University, Korea

DOI:

https://doi.org/10.1609/aaai.v40i10.37777

Abstract

Recent studies have revealed Neural Collapse (NC) in deep classifiers, where last-layer weights and features align into an equiangular tight frame (ETF), concentrating class information along specific embedding directions. However, conventional fine-tuning typically disregards this structure, initializing task-specific classifier heads randomly. To explicitly leverage this phenomenon, we propose a simple yet effective method for metric learning: (1) initializing the classifier head along each class’s NC direction from a pretrained model to preserve the emergent structure, and (2) injecting small isotropic Gaussian noise during finetuning to boost generalization. In addition, we provide a theoretical bound proving that our method explicitly reduces cumulative weight drift from the NC-initialization, compared to standard finetuning. This suggests that our method better preserves the pretrained model’s class-specific structure. Empirically, this structural preservation yields Recall@K gains: reduced weight drift correlates with better performance. Concurrent decreases in the Neural Collapse 1 (NC1) measure confirm that stronger intra‐class cohesion underlies these improvements. Furthermore, we validate the effectiveness of our method on class‐imbalanced benchmarks.

Published

2026-03-14

How to Cite

Park, J., Yoo, H. B., Kim, M., Zhang, B.-T., & Kwon, J. (2026). Neural Collapse-Informed Initialization with Perturbation Injection in Classification-based Metric Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(10), 8287–8295. https://doi.org/10.1609/aaai.v40i10.37777

Issue

Section

AAAI Technical Track on Computer Vision VII