Rethinking Disparity: A Depth Range Free Multi-View Stereo Based on Disparity

Authors

  • Qingsong Yan Wuhan University, Wuhan, China The Hong Kong University of Science and Technology, Hong Kong SAR, China
  • Qiang Wang Harbin Institute of Technology (Shenzhen), Shenzhen, China
  • Kaiyong Zhao XGRIDS, Shenzhen, China
  • Bo Li The Hong Kong University of Science and Technology, Hong Kong SAR, China
  • Xiaowen Chu The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, China The Hong Kong University of Science and Technology, Hong Kong SAR, China
  • Fei Deng Wuhan University

DOI:

https://doi.org/10.1609/aaai.v37i3.25413

Keywords:

CV: 3D Computer Vision, CV: Vision for Robotics & Autonomous Driving

Abstract

Existing learning-based multi-view stereo (MVS) methods rely on the depth range to build the 3D cost volume and may fail when the range is too large or unreliable. To address this problem, we propose a disparity-based MVS method based on the epipolar disparity flow (E-flow), called DispMVS, which infers the depth information from the pixel movement between two views. The core of DispMVS is to construct a 2D cost volume on the image plane along the epipolar line between each pair (between the reference image and several source images) for pixel matching and fuse uncountable depths triangulated from each pair by multi-view geometry to ensure multi-view consistency. To be robust, DispMVS starts from a randomly initialized depth map and iteratively refines the depth map with the help of the coarse-to-fine strategy. Experiments on DTUMVS and Tanks\&Temple datasets show that DispMVS is not sensitive to the depth range and achieves state-of-the-art results with lower GPU memory.

Downloads

Published

2023-06-26

How to Cite

Yan, Q., Wang, Q., Zhao, K., Li, B., Chu, X., & Deng, F. (2023). Rethinking Disparity: A Depth Range Free Multi-View Stereo Based on Disparity. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), 3091-3099. https://doi.org/10.1609/aaai.v37i3.25413

Issue

Section

AAAI Technical Track on Computer Vision III