MedSAMix: A Training-Free Model Merging Approach for Medical Image Segmentation

Authors

  • Yanwu Yang Department of Psychiatry and Psychotherapy, University Hospital Tübingen, Tübingen, Germany German Center for Mental Health (DZPG), partner site Halle/Jena/Magdeburg, Germany
  • Guinan Su Max Planck Institute for Intelligent Systems
  • Jiesi Hu Harbin Institute of Technology at Shenzhen, Shenzhen, China Peng Cheng Laboratory, Shenzhen, China
  • Francesco Sammarco Department of Psychiatry and Psychotherapy, University Hospital Tübingen, Tübingen, Germany
  • Jonas Geiping Max Planck Institute for Intelligent Systems ELLIS Institute Tübingen Tübingen AI Center
  • Thomas Wolfers Department of Psychiatry and Psychotherapy, University Hospital Tübingen, Tübingen, Germany German Center for Mental Health (DZPG), partner site Halle/Jena/Magdeburg, Germany Department of Psychology, Friedrich Schiller University of Jena, Germany

DOI:

https://doi.org/10.1609/aaai.v40i14.38161

Abstract

Universal medical image segmentation models have emerged as a promising paradigm due to their strong generalizability across diverse tasks, showing great potential for a wide range of clinical applications. This potential has been partly driven by the success of general-purpose vision models such as the Segment Anything Model (SAM), which has inspired the development of various fine-tuned variants for medical segmentation tasks. However, fine-tuned variants like MedSAM are trained on comparatively limited medical imaging data that often suffers from heterogeneity, scarce annotations, and distributional shifts. These challenges limit their ability to generalize across a wide range of medical segmentation tasks. In this regard, we propose MedSAMix, a training-free model merging method that integrates the strengths of both generalist models (e.g., SAM) and specialist models (e.g., MedSAM) for medical image segmentation. In contrast to traditional model merging approaches that rely on manual configuration and often result in suboptimal outcomes, we propose a zero-order optimization method to automatically discover optimal layer-wise merging solutions. Furthermore, for clinical applications, we develop two regimes to meet the demand of domain-specificity and generalizability in different scenarios by single-task optimization and multi-objective optimization respectively. Extensive evaluations on 25 medical segmentation tasks demonstrate that MedSAMix effectively mitigates model bias and consistently improves performance in both domain-specific accuracy and generalization, achieving improvements of 6.67% on specialized tasks and 4.37% on multi-task evaluations.

Downloads

Published

2026-03-14

How to Cite

Yang, Y., Su, G., Hu, J., Sammarco, F., Geiping, J., & Wolfers, T. (2026). MedSAMix: A Training-Free Model Merging Approach for Medical Image Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(14), 11757–11765. https://doi.org/10.1609/aaai.v40i14.38161

Issue

Section

AAAI Technical Track on Computer Vision XI