MSP-MVS: Multi-Granularity Segmentation Prior Guided Multi-View Stereo

Authors

  • Zhenlong Yuan Institute of Computing Technology, Chinese Academy of Sciences
  • Cong Liu Harbin Institute of Technology, Shenzhen
  • Fei Shen Nanjing University of Science and Technology
  • Zhaoxin Li Agricultural Information Institute, Chinese Academy of Agricultural Sciences Key Laboratory of Agricultural Big Data, Ministry of Agriculture and Rural Affairs
  • Jinguo Luo Harbin Institute of Technology, Shenzhen
  • Tianlu Mao Institute of Computing Technology, Chinese Academy of Sciences
  • Zhaoqi Wang Institute of Computing Technology, Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v39i9.33057

Abstract

Recently, patch deformation-based methods have demonstrated significant strength in multi-view stereo by adaptively expanding the reception field of patches to help reconstruct textureless areas. However, such methods mainly concentrate on searching for pixels without matching ambiguity (i.e., reliable pixels) when constructing deformed patches, while neglecting the deformation instability caused by unexpected edge-skipping, resulting in potential matching distortions. Addressing this, we propose MSP-MVS, a method introducing multi-granularity segmentation prior for edge-confined patch deformation. Specifically, to avoid unexpected edge-skipping, we first aggregate and further refine multi-granularity depth edges gained from Semantic-SAM as prior to guide patch deformation within depth-continuous (i.e., homogeneous) areas. Moreover, to address attention imbalance caused by edge-confined patch deformation, we implement adaptive equidistribution and disassemble-clustering of correlative reliable pixels (i.e., anchors), thereby promoting attention-consistent patch deformation. Finally, to prevent deformed patches from falling into local-minimum matching costs caused by the fixed sampling pattern, we introduce disparity-sampling synergistic 3D optimization to help identify global-minimum matching costs. Evaluations on ETH3D and Tanks & Temples benchmarks prove our method obtains state-of-the-art performance with remarkable generalization.

Published

2025-04-11

How to Cite

Yuan, Z., Liu, C., Shen, F., Li, Z., Luo, J., Mao, T., & Wang, Z. (2025). MSP-MVS: Multi-Granularity Segmentation Prior Guided Multi-View Stereo. Proceedings of the AAAI Conference on Artificial Intelligence, 39(9), 9753–9762. https://doi.org/10.1609/aaai.v39i9.33057

Issue

Section

AAAI Technical Track on Computer Vision VIII