DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately

Authors

  • Huiwen Wu Zhejiang Laboratory
  • Deyi Zhang Zhejiang Laboratory
  • Xiaohan Li Zhejiang Laboratory
  • Xiaogang Xu The Chinese University of Hong Kong
  • Jiafei Wu Zhejiang Laboratory
  • Zhe Liu Zhejiang Laboratory

DOI:

https://doi.org/10.1609/aaai.v39i26.34985

Abstract

The emergence of the large language model (LLM) has shown its superiority in a wide range of disciplines, including language understanding and translation, relational logic reasoning, and even partial differential equations solving. The transformer is the pervasive backbone architecture for the foundation model construction. It is vital to research how to adjust the Transformer architecture to achieve an end-to-end privacy guarantee in LLM fine-tuning. This paper investigates three potential information leaks during a federated fine-tuning procedure for LLM (FedLLM). Based on the potential information leakage, we insert two-stage randomness into FedLLM to provide an end-to-end privacy guarantee solution. The first stage is to train a gradient auto-encoder with a Gaussian random prior based on the statistical information of the gradients generated by local clients. The second stage is fine-tuning the overall LLM with a differential privacy guarantee by adopting appropriate Gaussian noises. We show our proposed method's efficiency and accuracy gains with several foundation models and two popular evaluation benchmarks. Furthermore, we present a comprehensive privacy analysis with Gaussian Differential Privacy (GDP) and Renyi Differential Privacy (RDP).

Downloads

Published

2025-04-11

How to Cite

Wu, H., Zhang, D., Li, X., Xu, X., Wu, J., & Liu, Z. (2025). DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately. Proceedings of the AAAI Conference on Artificial Intelligence, 39(26), 27706–27714. https://doi.org/10.1609/aaai.v39i26.34985

Issue

Section

AAAI Technical Track on AI Alignment