EC-MVSNet: Enhanced Cascaded Multi-View Stereo with Cross-Scale Relevance Integration

Shaoqian Wang; Jiadai Sun; Bin Fan; Qiang Wang; Bin Lu; Yuchao Dai

doi:10.1609/aaai.v40i12.37972

Authors

Shaoqian Wang Yanzhao Electric Power Laboratory of North China Electric Power University Hebei Key Laboratory of Knowledge Computing for Energy and Power
Jiadai Sun School of Electronics and Information, Northwestern Polytechnical University Baidu Inc.
Bin Fan School of Electronics and Information, Northwestern Polytechnical University
Qiang Wang Yanzhao Electric Power Laboratory of North China Electric Power University Hebei Key Laboratory of Knowledge Computing for Energy and Power
Bin Lu Yanzhao Electric Power Laboratory of North China Electric Power University Hebei Key Laboratory of Knowledge Computing for Energy and Power
Yuchao Dai School of Electronics and Information, Northwestern Polytechnical University

DOI:

https://doi.org/10.1609/aaai.v40i12.37972

Abstract

Cascade-based multi-scale architectures are currently the mainstream in Multi-view Stereo (MVS), achieving a balance between computational efficiency and reconstruction accuracy. However, existing cascade MVS methods suffer from significant limitations in cross-scale information utilization, where depth estimation processes operate independently across scales without fully exploiting the rich relevance between adjacent scales. To address this fundamental limitation, we propose an Enhanced Cascade Multi-View Stereo framework (EC-MVSNet), which introduces a novel cross-scale relevance integration strategy. Specifically, we introduce a Cross-Scale Feature-based Joint Construction (CFC) module to synergistically combine features from adjacent scales to build more reliable cost volumes. Additionally, a Cross-Scale Probability-guided Enhancement (CPE) module is proposed to propagate depth probability distributions across scales to guide cost volume enhancement. Furthermore, we propose a Monocular Feature-based Refinement (MFR) module to further enhance depth prediction accuracy by leveraging monocular priors. Extensive experiments demonstrate that EC-MVSNet achieves state-of-the-art performance on multiple benchmarks, validating the effectiveness of the cross-scale integration in improving MVS reconstruction quality.

EC-MVSNet: Enhanced Cascaded Multi-View Stereo with Cross-Scale Relevance Integration

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information