Flora: Dual-Frequency LOss-Compensated ReAl-Time Monocular 3D Video Reconstruction
DOI:
https://doi.org/10.1609/aaai.v37i2.25358Keywords:
CV: 3D Computer Vision, CV: Multi-modal Vision, CV: Vision for Robotics & Autonomous Driving, ML: Deep Neural Network AlgorithmsAbstract
In this work, we propose a real-time monocular 3D video reconstruction approach named Flora for reconstructing delicate and complete 3D scenes from RGB video sequences in an end-to-end manner. Specifically, we introduce a novel method with two main contributions. Firstly, the proposed feature aggregation module retains both color and reliability in a dual-frequency form. Secondly, the loss compensation module solves missing structure by correcting losses for falsely pruned voxels. The dual-frequency feature aggregation module enhances reconstruction quality in both precision and recall, and the loss compensation module benefits the recall. Notably, both proposed contributions achieve great results with negligible inferencing overhead. Our state-of-the-art experimental results on real-world datasets demonstrate Flora's leading performance in both effectiveness and efficiency. The code is available at https://github.com/NoOneUST/Flora.Downloads
Published
2023-06-26
How to Cite
Wang, L., Gong, Y., Wang, Q., Zhou, K., & Chen, L. (2023). Flora: Dual-Frequency LOss-Compensated ReAl-Time Monocular 3D Video Reconstruction. Proceedings of the AAAI Conference on Artificial Intelligence, 37(2), 2599-2607. https://doi.org/10.1609/aaai.v37i2.25358
Issue
Section
AAAI Technical Track on Computer Vision II