Efficient Indoor Depth Completion Network Using Mask-adaptive Gated Convolution

Tingxuan Huang; Jiacheng Miao; Shizhuo Deng; Tong Jia; Dongyue Chen

doi:10.1609/aaai.v39i4.32390

Authors

Tingxuan Huang Northeastern University
Jiacheng Miao Northeastern University
Shizhuo Deng Northeastern University
Tong Jia Northeastern University
Dongyue Chen Northeastern University

DOI:

https://doi.org/10.1609/aaai.v39i4.32390

Abstract

Most indoor depth completion tasks rely on convolutional auto-encoders to reconstruct depth images, especially in areas with significant missing values. While traditional convolution treats valid and missing pixels equally, Partial Convolution (PConv) has mitigated this limitation. However, PConv fails to distinguish the varying degree of invalidity across different missing areas, which highlights the need for a more refined strategy. To solve this problem, we propose a novel system for indoor depth completion tasks that leverages Mask-adaptive Gated Convolution (MagaConv). MagaConv utilizes gated signals to selectively apply convolution kernels based on the characteristics of missing depth data. These gating signals are generated using shared convolution kernels that jointly process depth features and corresponding masks, ensuring coherent weight optimization. Additionally, the mask undergoes iterative updates according to predefined rules. To improve the fusion of depth and color information, we introduce a Bi-directional Aligning Projection (Bid-AP) module, which utilizes a bi-directional projection scheme with global spatial-channel attention mechanisms to filter out depth-irrelevant features from other modalities. Extensive experiments on popular benchmarks, including NYU-Depth V2, DIML, and SUN RGB-D, demonstrate that our model outperforms state-of-the-art methods in both accuracy and efficiency.

Efficient Indoor Depth Completion Network Using Mask-adaptive Gated Convolution

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information