Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera

Authors

  • Haixin Shi EPFL
  • Yinlin Hu Magic Leap
  • Daniel Koguciuk Magic Leap
  • Juan-Ting Lin Magic Leap
  • Mathieu Salzmann EPFL
  • David Ferstl Magic Leap

DOI:

https://doi.org/10.1609/aaai.v39i7.32736

Abstract

We propose an approach for reconstructing free-moving object from a monocular RGB video. Most existing methods either assume scene prior, hand pose prior, object category pose prior, or rely on local optimization with multiple sequence segments. We propose a method that allows free interaction with the object in front of a moving camera without relying on any prior, and optimizes the sequence globally without any segments. We progressively optimize the object shape and pose simultaneously based on an implicit neural representation. A key aspect of our method is a virtual camera system that reduces the search space of the optimization significantly. We evaluate our method on the standard HO3D dataset and a collection of egocentric RGB sequences captured with a head-mounted device. We demonstrate that our approach outperforms most methods significantly, and is on par with recent techniques that assume prior information.

Published

2025-04-11

How to Cite

Shi, H., Hu, Y., Koguciuk, D., Lin, J.-T., Salzmann, M., & Ferstl, D. (2025). Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera. Proceedings of the AAAI Conference on Artificial Intelligence, 39(7), 6860–6868. https://doi.org/10.1609/aaai.v39i7.32736

Issue

Section

AAAI Technical Track on Computer Vision VI