FGNet: Leveraging Feature-Guided Attention to Refine SAM2 for 3D EM Neuron Segmentation

Authors

  • Zhenghua Li Department of Computer Science and Technology, Institute for AI, BNRist, Tsinghua University, Beijing 100084, China Tsinghua Laboratory of Brain and Intelligence (THBI), IDG/McGovern Institute for Brain Research, Tsinghua University, Beijing 100084, China
  • Hang Chen Department of Computer Science and Technology, Institute for AI, BNRist, Tsinghua University, Beijing 100084, China Tsinghua Laboratory of Brain and Intelligence (THBI), IDG/McGovern Institute for Brain Research, Tsinghua University, Beijing 100084, China
  • Zihao Sun Zhili College, Tsinghua University, Beijing 100084, China
  • Kai Li Department of Computer Science and Technology, Institute for AI, BNRist, Tsinghua University, Beijing 100084, China Tsinghua Laboratory of Brain and Intelligence (THBI), IDG/McGovern Institute for Brain Research, Tsinghua University, Beijing 100084, China
  • Xiaolin Hu Department of Computer Science and Technology, Institute for AI, BNRist, Tsinghua University, Beijing 100084, China Tsinghua Laboratory of Brain and Intelligence (THBI), IDG/McGovern Institute for Brain Research, Tsinghua University, Beijing 100084, China Chinese Institute for Brain Research (CIBR), Beijing 100010, China

DOI:

https://doi.org/10.1609/aaai.v40i8.37606

Abstract

Accurate segmentation of neural structures in Electron Microscopy (EM) images is paramount for neuroscience. However, this task is challenged by intricate morphologies, low signal-to-noise ratios, and scarce annotations, limiting the accuracy and generalization of existing methods. To address these challenges, we seek to leverage the priors learned by visual foundation models on a vast amount of natural images to better tackle this task. Specifically, we propose a novel framework that can effectively transfer knowledge from Segment Anything 2 (SAM2), which is pre-trained on natural images, to the EM domain. We first use SAM2 to extract powerful, general-purpose features. To bridge the domain gap, we introduce a Feature-Guided Attention module that leverages semantic cues from SAM2 to guide a lightweight encoder, the Fine-Grained Encoder (FGE), in focusing on these challenging regions. Finally, a dual-affinity decoder generates both coarse and refined affinity maps. Experimental results demonstrate that our method achieves performance comparable to state-of-the-art (SOTA) approaches with the SAM2 weights frozen. Upon further fine-tuning on EM data, our method significantly outperforms existing SOTA methods. This study validates that transferring representations pre-trained on natural images, when combined with targeted domain-adaptive guidance, can effectively address the specific challenges in neuron segmentation.

Downloads

Published

2026-03-14

How to Cite

Li, Z., Chen, H., Sun, Z., Li, K., & Hu, X. (2026). FGNet: Leveraging Feature-Guided Attention to Refine SAM2 for 3D EM Neuron Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(8), 6744–6752. https://doi.org/10.1609/aaai.v40i8.37606

Issue

Section

AAAI Technical Track on Computer Vision V