Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry

Authors

  • Junyoung Seo Korea Advanced Institute of Science & Technology
  • Jisang Han Korea Advanced Institute of Science & Technology
  • Jaewoo Jung Korea Advanced Institute of Science & Technology
  • Siyoon Jin Korea Advanced Institute of Science & Technology
  • JoungBin Lee Korea Advanced Institute of Science & Technology
  • Takuya Narihira Sony AI
  • Kazumi Fukuda Sony AI
  • Takashi Shibuya Sony AI
  • Donghoon Ahn Korea Advanced Institute of Science & Technology
  • Shoukang Hu Sony AI
  • Seungryong Kim Korea Advanced Institute of Science & Technology
  • Yuki Mitsufuji Sony AI Sony Group Corporation

DOI:

https://doi.org/10.1609/aaai.v40i11.37832

Abstract

We introduce a novel framework for video camera trajectory editing, enabling the re-synthesis of monocular videos along user-defined camera paths. This task is challenging due to its ill-posed nature and the limited multi-view video data for training. Traditional reconstruction methods struggle with extreme trajectory changes, and existing generative models for dynamic novel view synthesis cannot handle in-the-wild videos. Our approach consists of two steps: estimating temporally consistent geometry, and generative rendering guided by this geometry. By integrating geometric priors, the generative model focuses on synthesizing realistic details where the estimated geometry is uncertain. We eliminate the need for extensive 4D training data through a factorized fine-tuning framework that separately trains spatial and temporal components using multi-view image and video data. Our method outperforms baselines in producing plausible videos from novel camera trajectories, especially in extreme extrapolation scenarios on real-world footage.

Downloads

Published

2026-03-14

How to Cite

Seo, J., Han, J., Jung, J., Jin, S., Lee, J., Narihira, T., Fukuda, K., Shibuya, T., Ahn, D., Hu, S., Kim, S., & Mitsufuji, Y. (2026). Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry. Proceedings of the AAAI Conference on Artificial Intelligence, 40(11), 8787-8795. https://doi.org/10.1609/aaai.v40i11.37832

Issue

Section

AAAI Technical Track on Computer Vision VIII