PanFlow: Decoupled Motion Control for Panoramic Video Generation

Authors

  • Cheng Zhang Department of Data Science and Artificial Intelligence, Monash University, Clayton, Victoria, Australia Building 4.0 CRC, Caulfield East, Victoria, Australia
  • Hanwen Liang The Edward S Rogers Sr. ECE Department, University of Toronto, Toronto, M5S3G8, Canada
  • Donny Y. Chen Department of Data Science and Artificial Intelligence, Monash University, Clayton, Victoria, Australia
  • Qianyi Wu Department of Data Science and Artificial Intelligence, Monash University, Clayton, Victoria, Australia
  • Konstantinos N. Plataniotis The Edward S Rogers Sr. ECE Department, University of Toronto, Toronto, M5S3G8, Canada
  • Camilo Cruz Gambardella Building 4.0 CRC, Caulfield East, Victoria, Australia Future Building Initiative, Monash University, Caulfield East, Victoria, Australia
  • Jianfei Cai Department of Data Science and Artificial Intelligence, Monash University, Clayton, Victoria, Australia

DOI:

https://doi.org/10.1609/aaai.v40i15.38231

Abstract

Panoramic video generation has attracted growing attention due to its applications in virtual reality and immersive media. However, existing methods lack explicit motion control and struggle to generate scenes with large and complex motions. We propose PanFlow a novel approach that exploits the spherical nature of panoramas to decouple the highly dynamic camera rotation from the input optical flow condition, enabling more precise control over large and dynamic motions. We further introduce a spherical noise warping strategy to promote loop consistency in motion across panorama boundaries. To support effective training, we curate a large-scale, motion-rich panoramic video dataset with frame-level pose and flow annotations. We also showcase the effectiveness of our method in various applications, including motion transfer and video editing. Extensive experiments demonstrate that PanFlow significantly outperforms prior methods in motion fidelity, visual quality, and temporal coherence.

Published

2026-03-14

How to Cite

Zhang, C., Liang, H., Chen, D. Y., Wu, Q., Plataniotis, K. N., Gambardella, C. C., & Cai, J. (2026). PanFlow: Decoupled Motion Control for Panoramic Video Generation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(15), 12385–12393. https://doi.org/10.1609/aaai.v40i15.38231

Issue

Section

AAAI Technical Track on Computer Vision XII