Structure Flow-Guided Network for Real Depth Super-resolution

Authors

  • Jiayi Yuan Nanjing University of Science and Technology
  • Haobo Jiang Nanjing University of Science and Technology
  • Xiang Li Nanjing University of Science and Technology
  • Jianjun Qian Nanjing University of Science and Technology
  • Jun Li Nanjing University of Science and Technology
  • Jian Yang Nanjing University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v37i3.25441

Keywords:

CV: 3D Computer Vision, CV: Multi-modal Vision

Abstract

Real depth super-resolution (DSR), unlike synthetic settings, is a challenging task due to the structural distortion and the edge noise caused by the natural degradation in real-world low-resolution (LR) depth maps. These defeats result in significant structure inconsistency between the depth map and the RGB guidance, which potentially confuses the RGB-structure guidance and thereby degrades the DSR quality. In this paper, we propose a novel structure flow-guided DSR framework, where a cross-modality flow map is learned to guide the RGB-structure information transferring for precise depth upsampling. Specifically, our framework consists of a cross-modality flow-guided upsampling network (CFUNet) and a flow-enhanced pyramid edge attention network (PEANet). CFUNet contains a trilateral self-attention module combining both the geometric and semantic correlations for reliable cross-modality flow learning. Then, the learned flow maps are combined with the grid-sampling mechanism for coarse high-resolution (HR) depth prediction. PEANet targets at integrating the learned flow map as the edge attention into a pyramid network to hierarchically learn the edge-focused guidance feature for depth edge refinement. Extensive experiments on real and synthetic DSR datasets verify that our approach achieves excellent performance compared to state-of-the-art methods. Our code is available at: https://github.com/Yuanjiayii/DSR-SFG.

Downloads

Published

2023-06-26

How to Cite

Yuan, J., Jiang, H., Li, X., Qian, J., Li, J., & Yang, J. (2023). Structure Flow-Guided Network for Real Depth Super-resolution. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), 3340-3348. https://doi.org/10.1609/aaai.v37i3.25441

Issue

Section

AAAI Technical Track on Computer Vision III