Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning

Authors

  • Sirui Liang The Key Laboratory of Cognition and Decision Intelligence for Complex Systems, Institute of automation, Chinese academy of science, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China Zhongguancun Academy, Beijing, China
  • Pengfei Cao The Key Laboratory of Cognition and Decision Intelligence for Complex Systems, Institute of automation, Chinese academy of science, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China
  • Jian Zhao Zhongguancun Academy, Beijing, China Zhongguancun Institute of Artificial Intelligence, Beijing, China
  • Cong Huang Zhongguancun Academy, Beijing, China Zhongguancun Institute of Artificial Intelligence, Beijing, China
  • Jun Zhao The Key Laboratory of Cognition and Decision Intelligence for Complex Systems, Institute of automation, Chinese academy of science, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China
  • Kang Liu The Key Laboratory of Cognition and Decision Intelligence for Complex Systems, Institute of automation, Chinese academy of science, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China

DOI:

https://doi.org/10.1609/aaai.v40i38.40461

Abstract

Parameter-Efficient finetuning (PEFT) enhances model performance on downstream tasks by updating a minimal subset of parameters. Representation finetuning (ReFT) methods further improve efficiency by freezing model weights and optimizing internal representations with fewer parameters than PEFT, outperforming PEFT on several tasks. However, ReFT exhibits a significant performance decline on mathematical reasoning tasks. To address this problem, the paper demonstrates that ReFT's poor performance on mathematical tasks primarily stems from its struggle to generate effective reasoning prefixes during the early inference phase. Moreover, ReFT disturbs the numerical encoding and the error accumulats during the CoT stage. Based on these observations, this paper proposes Bias-REstrained Prefix Representation FineTuning (BREP ReFT), which enhances ReFT's mathematical reasoning capability by truncating training data to optimize the generation of initial reasoning prefixes, intervening on the early inference stage to prevent error accumulation, and constraining the intervention vectors' magnitude to avoid disturbing numerical encoding. Extensive experiments across diverse model architectures demonstrate BREP's superior effectiveness, efficiency, and robust generalization capability, outperforming both standard ReFT and weight-based PEFT methods on the task of mathematical reasoning.

Downloads

Published

2026-03-14

How to Cite

Liang, S., Cao, P., Zhao, J., Huang, C., Zhao, J., & Liu, K. (2026). Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(38), 31916–31924. https://doi.org/10.1609/aaai.v40i38.40461

Issue

Section

AAAI Technical Track on Natural Language Processing III