AoP-SAM: Automation of Prompts for Efficient Segmentation

Authors

  • Yi Chen KAIST
  • Muyoung Son KAIST
  • Chuanbo Hua KAIST
  • Joo-Young Kim KAIST

DOI:

https://doi.org/10.1609/aaai.v39i2.32228

Abstract

The Segment Anything Model (SAM) is a powerful foundation model for image segmentation, showing robust zero-shot generalization through prompt engineering. However, relying on manual prompts is impractical for real-world applications, particularly in scenarios where rapid prompt provision and resource efficiency are crucial. In this paper, we propose the Automation of Prompts for SAM (AoP-SAM), a novel approach that learns to generate essential prompts in optimal locations automatically. AoP-SAM enhances SAM’s efficiency and usability by eliminating manual input, making it better suited for real-world tasks. Our approach employs a lightweight yet efficient Prompt Predictor model that detects key entities across images and identifies the optimal regions for placing prompt candidates. This method leverages SAM’s image embeddings, preserving its zero-shot generalization capabilities without requiring fine-tuning. Additionally, we introduce a test-time instance-level Adaptive Sampling and Filtering mechanism that generates prompts in a coarse-to-fine manner. This notably enhances both prompt and mask generation efficiency by reducing computational overhead and minimizing redundant mask refinements. Evaluations of three datasets demonstrate that AoP-SAM substantially improves both prompt generation efficiency and mask generation accuracy, making SAM more effective for automated segmentation tasks.

Downloads

Published

2025-04-11

How to Cite

Chen, Y., Son, M., Hua, C., & Kim, J.-Y. (2025). AoP-SAM: Automation of Prompts for Efficient Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 39(2), 2284–2292. https://doi.org/10.1609/aaai.v39i2.32228

Issue

Section

AAAI Technical Track on Computer Vision I