Adapt Before Continual Learning

Authors

  • Aojun Lu College of Computer Science, Sichuan University, Chengdu, China
  • Tao Feng Department of Computer Science and Technology, Tsinghua University, Beijing, China
  • Hangjie Yuan College of Computer Science and Technology, Zhejiang University, Hangzhou, China
  • Chunhui Ding College of Computer Science, Sichuan University, Chengdu, China
  • Yanan Sun College of Computer Science, Sichuan University, Chengdu, China

DOI:

https://doi.org/10.1609/aaai.v40i29.39584

Abstract

Continual Learning (CL) seeks to enable neural networks to incrementally acquire new knowledge (plasticity) while retaining existing knowledge (stability). Although pre-trained models (PTMs) have provided a strong foundation for CL, existing approaches face a fundamental challenge in balancing these two competing objectives. Current methods typically address stability by freezing the PTM backbone, which severely limits the model's plasticity, particularly when incoming data distribution diverges largely from the pre-training data. Alternatively, sequentially fine-tuning the entire PTM can adapt to new knowledge but often leads to catastrophic forgetting, highlighting the critical stability-plasticity trade-off in PTM-based CL. To address this limitation, we propose Adapting PTMs before the core CL process (ACL), a novel framework that introduces a plug-and-play adaptation phase prior to learning each new task. During this phase, ACL refines the PTM backbone by aligning embeddings with their original class prototypes while distancing them from irrelevant classes. This mechanism theoretically and empirically demonstrates desirable balance between stability and plasticity, significantly improving CL performance across benchmarks and integrated methods.

Downloads

Published

2026-03-14

How to Cite

Lu, A., Feng, T., Yuan, H., Ding, C., & Sun, Y. (2026). Adapt Before Continual Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(29), 24061-24069. https://doi.org/10.1609/aaai.v40i29.39584

Issue

Section

AAAI Technical Track on Machine Learning VI