Modeling Stereo-Confidence out of the End-to-End Stereo-Matching Network via Disparity Plane Sweep

Authors

  • Jae Young Lee KAIST
  • Woonghyun Ka Hyundai Motor Company
  • Jaehyun Choi KAIST
  • Junmo Kim KAIST

DOI:

https://doi.org/10.1609/aaai.v38i4.28071

Keywords:

CV: 3D Computer Vision, APP: Mobility, Driving & Flight, CV: Vision for Robotics & Autonomous Driving

Abstract

We propose a novel stereo-confidence that can be measured externally to various stereo-matching networks, offering an alternative input modality choice of the cost volume for learning-based approaches, especially in safety-critical systems. Grounded in the foundational concepts of disparity definition and the disparity plane sweep, the proposed stereo-confidence method is built upon the idea that any shift in a stereo-image pair should be updated in a corresponding amount shift in the disparity map. Based on this idea, the proposed stereo-confidence method can be summarized in three folds. 1) Using the disparity plane sweep, multiple disparity maps can be obtained and treated as a 3-D volume (predicted disparity volume), like the cost volume is constructed. 2) One of these disparity maps serves as an anchor, allowing us to define a desirable (or ideal) disparity profile at every spatial point. 3) By comparing the desirable and predicted disparity profiles, we can quantify the level of matching ambiguity between left and right images for confidence measurement. Extensive experimental results using various stereo-matching networks and datasets demonstrate that the proposed stereo-confidence method not only shows competitive performance on its own but also consistent performance improvements when it is used as an input modality for learning-based stereo-confidence methods.

Published

2024-03-24

How to Cite

Lee, J. Y., Ka, W., Choi, J., & Kim, J. (2024). Modeling Stereo-Confidence out of the End-to-End Stereo-Matching Network via Disparity Plane Sweep. Proceedings of the AAAI Conference on Artificial Intelligence, 38(4), 2901-2910. https://doi.org/10.1609/aaai.v38i4.28071

Issue

Section

AAAI Technical Track on Computer Vision III