Flora: Dual-Frequency LOss-Compensated ReAl-Time Monocular 3D Video Reconstruction

Authors

  • Likang Wang Department of Computer Science and Engineering, The Hong Kong University of Science and Technology
  • Yue Gong Distributed and Parallel Software Lab, Huawei Technologies
  • Qirui Wang Distributed and Parallel Software Lab, Huawei Technologies
  • Kaixuan Zhou Riemann Lab, Huawei Technologies Fundamental Software Innovation Lab, Huawei Technologies
  • Lei Chen Department of Computer Science and Engineering, The Hong Kong University of Science and Technology Data Science and Analytics Thrust, The Hong Kong University of Science and Technology (Guangzhou)

DOI:

https://doi.org/10.1609/aaai.v37i2.25358

Keywords:

CV: 3D Computer Vision, CV: Multi-modal Vision, CV: Vision for Robotics & Autonomous Driving, ML: Deep Neural Network Algorithms

Abstract

In this work, we propose a real-time monocular 3D video reconstruction approach named Flora for reconstructing delicate and complete 3D scenes from RGB video sequences in an end-to-end manner. Specifically, we introduce a novel method with two main contributions. Firstly, the proposed feature aggregation module retains both color and reliability in a dual-frequency form. Secondly, the loss compensation module solves missing structure by correcting losses for falsely pruned voxels. The dual-frequency feature aggregation module enhances reconstruction quality in both precision and recall, and the loss compensation module benefits the recall. Notably, both proposed contributions achieve great results with negligible inferencing overhead. Our state-of-the-art experimental results on real-world datasets demonstrate Flora's leading performance in both effectiveness and efficiency. The code is available at https://github.com/NoOneUST/Flora.

Downloads

Published

2023-06-26

How to Cite

Wang, L., Gong, Y., Wang, Q., Zhou, K., & Chen, L. (2023). Flora: Dual-Frequency LOss-Compensated ReAl-Time Monocular 3D Video Reconstruction. Proceedings of the AAAI Conference on Artificial Intelligence, 37(2), 2599-2607. https://doi.org/10.1609/aaai.v37i2.25358

Issue

Section

AAAI Technical Track on Computer Vision II