From Parameter to Representation: A Closed-Form Approach for Controllable Model Merging

Authors

  • Jialin Wu Department of Computer, Rocket Force University of Engineering
  • Jian Yang Department of Engineering, Rocket Force University of Engineering
  • Handing Wang School of Artificial Intelligence, Xidian University
  • Jiajun Wen Department of Computer, Rocket Force University of Engineering
  • Zhiyong Yu Department of Computer, Rocket Force University of Engineering

DOI:

https://doi.org/10.1609/aaai.v40i32.39902

Abstract

Model merging combines expert models for multitask performance but faces challenges from parameter interference. This has sparked recent interest in controllable model merging, giving users the ability to explicitly balance performance trade-offs. Existing approaches employ a compile-then-query paradigm, performing a costly offline multi-objective optimization to enable fast, preference-aware model generation. This offline stage typically involves iterative search or dedicated training, with complexity that grows exponentially with the number of tasks. To overcome these limitations, we shift the perspective from parameter-space optimization to a direct correction of the model's final representation. Our approach models this correction as an optimal linear transformation, yielding a closed-form solution that replaces the entire offline optimization process with a single-step, architecture-agnostic computation. This solution directly incorporates user preferences, allowing a Pareto-optimal model to be generated on-the-fly with complexity that scales linearly with the number of tasks. Experimental results show our method generates a superior Pareto front with more precise preference alignment and drastically reduced computational cost.

Downloads

Published

2026-03-14

How to Cite

Wu, J., Yang, J., Wang, H., Wen, J., & Yu, Z. (2026). From Parameter to Representation: A Closed-Form Approach for Controllable Model Merging. Proceedings of the AAAI Conference on Artificial Intelligence, 40(32), 26903–26912. https://doi.org/10.1609/aaai.v40i32.39902

Issue

Section

AAAI Technical Track on Machine Learning IX