Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera
DOI:
https://doi.org/10.1609/aaai.v39i7.32736Abstract
We propose an approach for reconstructing free-moving object from a monocular RGB video. Most existing methods either assume scene prior, hand pose prior, object category pose prior, or rely on local optimization with multiple sequence segments. We propose a method that allows free interaction with the object in front of a moving camera without relying on any prior, and optimizes the sequence globally without any segments. We progressively optimize the object shape and pose simultaneously based on an implicit neural representation. A key aspect of our method is a virtual camera system that reduces the search space of the optimization significantly. We evaluate our method on the standard HO3D dataset and a collection of egocentric RGB sequences captured with a head-mounted device. We demonstrate that our approach outperforms most methods significantly, and is on par with recent techniques that assume prior information.Downloads
Published
2025-04-11
How to Cite
Shi, H., Hu, Y., Koguciuk, D., Lin, J.-T., Salzmann, M., & Ferstl, D. (2025). Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera. Proceedings of the AAAI Conference on Artificial Intelligence, 39(7), 6860–6868. https://doi.org/10.1609/aaai.v39i7.32736
Issue
Section
AAAI Technical Track on Computer Vision VI