Efficient Indoor Depth Completion Network Using Mask-adaptive Gated Convolution
DOI:
https://doi.org/10.1609/aaai.v39i4.32390Abstract
Most indoor depth completion tasks rely on convolutional auto-encoders to reconstruct depth images, especially in areas with significant missing values. While traditional convolution treats valid and missing pixels equally, Partial Convolution (PConv) has mitigated this limitation. However, PConv fails to distinguish the varying degree of invalidity across different missing areas, which highlights the need for a more refined strategy. To solve this problem, we propose a novel system for indoor depth completion tasks that leverages Mask-adaptive Gated Convolution (MagaConv). MagaConv utilizes gated signals to selectively apply convolution kernels based on the characteristics of missing depth data. These gating signals are generated using shared convolution kernels that jointly process depth features and corresponding masks, ensuring coherent weight optimization. Additionally, the mask undergoes iterative updates according to predefined rules. To improve the fusion of depth and color information, we introduce a Bi-directional Aligning Projection (Bid-AP) module, which utilizes a bi-directional projection scheme with global spatial-channel attention mechanisms to filter out depth-irrelevant features from other modalities. Extensive experiments on popular benchmarks, including NYU-Depth V2, DIML, and SUN RGB-D, demonstrate that our model outperforms state-of-the-art methods in both accuracy and efficiency.Downloads
Published
2025-04-11
How to Cite
Huang, T., Miao, J., Deng, S., Jia, T., & Chen, D. (2025). Efficient Indoor Depth Completion Network Using Mask-adaptive Gated Convolution. Proceedings of the AAAI Conference on Artificial Intelligence, 39(4), 3742-3750. https://doi.org/10.1609/aaai.v39i4.32390
Issue
Section
AAAI Technical Track on Computer Vision III