LoKI: Low-Damage Knowledge Implanting of Large Language Models

Runyu Wang; Peng Ping; Zhengyu Guo; Xiaoye Zhang; Quan Shi; Liting Zhou; Tianbo Ji

doi:10.1609/aaai.v40i39.40651

Authors

Runyu Wang School of Information Science and Technology, Nantong University
Peng Ping School of Transportation and Civil Engineering, Nantong University
Zhengyu Guo South China University of Technology
Xiaoye Zhang China Southern Power Grid Company Limited
Quan Shi School of Transportation and Civil Engineering, Nantong University
Liting Zhou Dublin City University
Tianbo Ji School of Transportation and Civil Engineering, Nantong University

DOI:

https://doi.org/10.1609/aaai.v40i39.40651

Abstract

Fine-tuning adapts pretrained models for specific tasks but poses the risk of catastrophic forgetting (CF), where critical knowledge from pretraining is overwritten. To address the issue of CF in a general-purpose framework, we propose Low-damage Knowledge Implanting (LoKI), a parameter-efficient fine-tuning (PEFT) technique that utilizes recent mechanistic understanding of how knowledge is stored in transformer architectures. We compare LoKI against state-of-the-art PEFT methods in two real-world fine-tuning scenarios. The results show that LoKI demonstrates significantly better preservation of general capabilities. At the same time, its task-specific performance is comparable to or even surpasses that of full parameter fine-tuning and these PEFT methods across various model architectures. Our work bridges the mechanistic insights of LLMs' knowledge storage with practical fine-tuning objectives, enabling an effective balance between task-specific adaptation and the retention of general-purpose capabilities.

LoKI: Low-Damage Knowledge Implanting of Large Language Models

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information