Retriever Encoder Selection Matters for In-Context Learning-based Medical Segmentation

Authors

  • Fan Wang Shandong University
  • Zhongyi Han Shandong University
  • Yongshun Gong Shandong University
  • Yilong Yin Shandong University

DOI:

https://doi.org/10.1609/aaai.v40i31.39819

Abstract

In-context learning-based medical segmentation (ICLM) enables foundation models to generalize to unseen cases without retraining. To enhance performance on test queries, existing methods typically follow a two-stage process: (1) using a retrieval encoder (RE) to map both queries and training samples into a shared feature space, and (2) retrieving and utilizing the top-k most similar training samples. While current methods fix the RE and focus on optimizing stage (2), we show that the choice of RE in stage (1) alone can account for over 70% of the performance variation, highlighting RE selection as a critical yet often overlooked factor in ICLM. In this paper, we conduct an analysis of the RE selection and make two main findings: (1) dynamically selecting the RE for each query outperforms selecting a fixed RE for the entire task; and (2) feature-space heuristics (e.g., intra-class compactness and inter-class separability) fail to predict RE quality. To this end, we propose the instance-adaptive retrieval encoder selection (IRES) method that can select the optimal RE for each query based on output predictions. IRES is based on the intuition that a good RE retrieves relevant demonstrations, helping the ICL model generate more accurate and stable segmentation masks. Thus, we introduce the shape stability score (S³), which evaluates the morphological stability of predicted masks under iterative erosion. Experiments show S³ correlates strongly with true RE quality (Pearson > 0.8), serving as a reliable selection proxy. To reduce S³’s per-query cost, we propose parallel prediction with reciprocal neighbor reuse (P2R), which accelerates inference by parallelizing encoding and reusing encoder selections across reciprocal neighbors, avoiding redundant computation. Built on S³ and P2R, IRES improves ICLM performance across FUNDUS, Brain MRI, and Chest X-ray datasets, with up to 10.6% gain on fundus segmentation.

Downloads

Published

2026-03-14

How to Cite

Wang, F., Han, Z., Gong, Y., & Yin, Y. (2026). Retriever Encoder Selection Matters for In-Context Learning-based Medical Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(31), 26160–26168. https://doi.org/10.1609/aaai.v40i31.39819

Issue

Section

AAAI Technical Track on Machine Learning VIII