LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization

Authors

  • Junsong Li East China Normal University
  • Jie Zhou East China Normal University
  • Bihao Zhan East China Normal University
  • Yutao Yang East China Normal University
  • Qianjun Pan East China Normal University
  • Shilian Chen East China Normal University
  • Tianyu Huai East China Normal University
  • Xin Li Shanghai Artificial Intelligence Laboratory
  • Qin Chen East China Normal University
  • Liang He East China Normal University

DOI:

https://doi.org/10.1609/aaai.v40i37.40428

Abstract

Alignment plays a crucial role in Large Language Models (LLMs) in aligning with human preferences on a specific task/domain. Traditional alignment methods suffer from catastrophic forgetting, where models lose previously learned values when adapting to new preferences or domains. We introduce LifeAlign, a novel framework for lifelong alignment that enables LLMs to maintain consistent human preference alignment across sequential learning tasks without forgetting previously learned values. Our approach consists of two key innovations. First, we propose a focalized preference optimization strategy that aligns LLMs with new preferences while preventing the erosion of alignment acquired from previous tasks. Second, we develop a short-to-long memory consolidation mechanism that merges denoised short-term preference representations into stable long-term memory using intrinsic dimensionality reduction, enabling efficient storage and retrieval of alignment patterns across diverse domains. We evaluate LifeAlign across multiple sequential alignment tasks spanning different domains and preference types. Experimental results demonstrate that our method achieves superior performance in maintaining both preference alignment quality and knowledge retention compared to existing lifelong learning approaches.

Published

2026-03-14

How to Cite

Li, J., Zhou, J., Zhan, B., Yang, Y., Pan, Q., Chen, S., … He, L. (2026). LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization. Proceedings of the AAAI Conference on Artificial Intelligence, 40(37), 31618–31626. https://doi.org/10.1609/aaai.v40i37.40428

Issue

Section

AAAI Technical Track on Natural Language Processing II