Adaptive FSS: A Novel Few-Shot Segmentation Framework via Prototype Enhancement

Authors

  • Jing Wang School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, China Key Laboratory of Knowledge Automation for Industrial Processes, Ministry of Education, Beijing, China
  • Jiangyun Li School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, China Key Laboratory of Knowledge Automation for Industrial Processes, Ministry of Education, Beijing, China
  • Chen Chen Center for Research in Computer Vision, University of Central Florida, Orlando, USA
  • Yisi Zhang School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, China Key Laboratory of Knowledge Automation for Industrial Processes, Ministry of Education, Beijing, China
  • Haoran Shen School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, China Key Laboratory of Knowledge Automation for Industrial Processes, Ministry of Education, Beijing, China
  • Tianxiang Zhang School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, China Key Laboratory of Knowledge Automation for Industrial Processes, Ministry of Education, Beijing, China

DOI:

https://doi.org/10.1609/aaai.v38i6.28355

Keywords:

CV: Segmentation, ML: Transfer, Domain Adaptation, Multi-Task Learning

Abstract

The Few-Shot Segmentation (FSS) aims to accomplish the novel class segmentation task with a few annotated images. Current FSS research based on meta-learning focuses on designing a complex interaction mechanism between the query and support feature. However, unlike humans who can rapidly learn new things from limited samples, the existing approach relies solely on fixed feature matching to tackle new tasks, lacking adaptability. In this paper, we propose a novel framework based on the adapter mechanism, namely Adaptive FSS, which can efficiently adapt the existing FSS model to the novel classes. In detail, we design the Prototype Adaptive Module (PAM), which utilizes accurate category information provided by the support set to derive class prototypes, enhancing class-specific information in the multi-stage representation. In addition, our approach is compatible with diverse FSS methods with different backbones by simply inserting PAM between the layers of the encoder. Experiments demonstrate that our method effectively improves the performance of the FSS models (e.g., MSANet, HDMNet, FPTrans, and DCAMA) and achieves new state-of-the-art (SOTA) results (i.e., 72.4% and 79.1% mIoU on PASCAL-5i 1-shot and 5-shot settings, 52.7% and 60.0% mIoU on COCO-20i 1-shot and 5-shot settings). Our code is available at https://github.com/jingw193/AdaptiveFSS.

Published

2024-03-24

How to Cite

Wang, J., Li, J., Chen, C., Zhang, Y., Shen, H., & Zhang, T. (2024). Adaptive FSS: A Novel Few-Shot Segmentation Framework via Prototype Enhancement. Proceedings of the AAAI Conference on Artificial Intelligence, 38(6), 5463-5471. https://doi.org/10.1609/aaai.v38i6.28355

Issue

Section

AAAI Technical Track on Computer Vision V