Enhancing Robustness in Incremental Learning with Adversarial Training

Authors

  • Seungju Cho Korea Advanced Institute of Science and Technology
  • Hongsin Lee Korea Advanced Institute of Science and Technology
  • Changick Kim Korea Advanced Institute of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v39i3.32254

Abstract

Adversarial training is one of the most effective approaches against adversarial attacks. However, adversarial training has primarily been studied in scenarios where data for all classes is provided, with limited research conducted in the context of incremental learning where knowledge is introduced sequentially. In this study, we investigate Adversarially Robust Class Incremental Learning (ARCIL), which deals with adversarial robustness in incremental learning. We first explore a series of baselines that integrate incremental learning with existing adversarial training methods, finding that they lead to conflicts between acquiring new knowledge and retaining past knowledge. Furthermore, we discover that training new knowledge causes the disappearance of a key characteristic in robust models: a flat loss landscape in input space. To address such issues, we propose a novel and robust baseline for ARCIL, named FLatness preserving Adversarial Incremental learning for Robustness (FLAIR). Experimental results demonstrate that FLAIR significantly outperforms other baselines. To the best of our knowledge, we are the first to comprehensively investigate the baselines, challenges, and solutions for ARCIL, which we believe represents a significant advance toward achieving real-world robustness.

Published

2025-04-11

How to Cite

Cho, S., Lee, H., & Kim, C. (2025). Enhancing Robustness in Incremental Learning with Adversarial Training. Proceedings of the AAAI Conference on Artificial Intelligence, 39(3), 2518–2526. https://doi.org/10.1609/aaai.v39i3.32254

Issue

Section

AAAI Technical Track on Computer Vision II