VasoMIM: Vascular Anatomy-Aware Masked Image Modeling for Vessel Segmentation

Authors

  • De-Xing Huang Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences
  • Xiao-Hu Zhou Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences
  • Mei-Jiang Gui Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences
  • Xiao-Liang Xie Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences
  • Shi-Qi Liu Institute of Automation, Chinese Academy of Sciences
  • Shuang-Yi Wang Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences
  • Tian-Yu Xiang Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences
  • Rui-Ze Ma Institute of Automation, Chinese Academy of Sciences
  • Nu-Fang Xiao Institute of Automation, Chinese Academy of Sciences
  • Zeng-Guang Hou Institute of Automation, Chinese Academy of Sciences School of Artificial Intelligence, University of Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v40i6.42503

Abstract

Accurate vessel segmentation in X-ray angiograms is crucial for numerous clinical applications. However, the scarcity of annotated data presents a significant challenge, which has driven the adoption of self-supervised learning (SSL) methods such as masked image modeling (MIM) to leverage large-scale unlabeled data for learning transferable representations. Unfortunately, conventional MIM often fails to capture vascular anatomy because of the severe class imbalance between vessel and background pixels, leading to weak vascular representations. To address this, we introduce Vascular anatomy-aware Masked Image Modeling (VasoMIM), a novel MIM framework tailored for X-ray angiograms that explicitly integrates anatomical knowledge into the pre-training process. Specifically, it comprises two complementary components: anatomy-guided masking strategy and anatomical consistency loss. The former preferentially masks vessel-containing patches to focus the model on reconstructing vessel-relevant regions. The latter enforces consistency in vascular semantics between the original and reconstructed images, thereby improving the discriminability of vascular representations. Empirically, VasoMIM achieves state-of-the-art performance across three datasets. These findings highlight its potential to facilitate X-ray angiogram analysis.

Downloads

Published

2026-03-14

How to Cite

Huang, D.-X., Zhou, X.-H., Gui, M.-J., Xie, X.-L., Liu, S.-Q., Wang, S.-Y., … Hou, Z.-G. (2026). VasoMIM: Vascular Anatomy-Aware Masked Image Modeling for Vessel Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(6), 4985–4993. https://doi.org/10.1609/aaai.v40i6.42503

Issue

Section

AAAI Technical Track on Computer Vision III