LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization

Junsong Li; Jie Zhou; Bihao Zhan; Yutao Yang; Qianjun Pan; Shilian Chen; Tianyu Huai; Xin Li; Qin Chen; Liang He

doi:10.1609/aaai.v40i37.40428

Authors

Junsong Li East China Normal University
Jie Zhou East China Normal University
Bihao Zhan East China Normal University
Yutao Yang East China Normal University
Qianjun Pan East China Normal University
Shilian Chen East China Normal University
Tianyu Huai East China Normal University
Xin Li Shanghai Artificial Intelligence Laboratory
Qin Chen East China Normal University
Liang He East China Normal University

DOI:

https://doi.org/10.1609/aaai.v40i37.40428

Abstract

Alignment plays a crucial role in Large Language Models (LLMs) in aligning with human preferences on a specific task/domain. Traditional alignment methods suffer from catastrophic forgetting, where models lose previously learned values when adapting to new preferences or domains. We introduce LifeAlign, a novel framework for lifelong alignment that enables LLMs to maintain consistent human preference alignment across sequential learning tasks without forgetting previously learned values. Our approach consists of two key innovations. First, we propose a focalized preference optimization strategy that aligns LLMs with new preferences while preventing the erosion of alignment acquired from previous tasks. Second, we develop a short-to-long memory consolidation mechanism that merges denoised short-term preference representations into stable long-term memory using intrinsic dimensionality reduction, enabling efficient storage and retrieval of alignment patterns across diverse domains. We evaluate LifeAlign across multiple sequential alignment tasks spanning different domains and preference types. Experimental results demonstrate that our method achieves superior performance in maintaining both preference alignment quality and knowledge retention compared to existing lifelong learning approaches.

LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information