360Explorer: Exploring 4D Controllable World in Panoramic Videos

Authors

  • Xinhua Cheng Peking University
  • Haiyang Zhou Peking University
  • Wangbo Yu Peking University
  • Tanghui Jia Peking University
  • Bin Lin Peking University
  • Yunyang Ge Peking University
  • Weiqi Li Peking University
  • Li Yuan Peking University

DOI:

https://doi.org/10.1609/aaai.v40i5.37325

Abstract

We present 360Explorer, a novel approach for generating 4D controllable panoramic videos conditioned on user-provided 3D instructions for exploring and manipulating dynamic worlds. Compared to existing perspective-based methods struggle to address spatial consistency during camera rotation in place, we introduce the panoramic view in controllable video generation models to inherently maintain the view recall consistency. By introducing dynamic point clouds as the 4D scene representations, 360Explorer unifies the modeling of camera transformations and object movements as incomplete renders to describe precise control instructions in 3D worlds. To tackle the data limitation in acquiring multi-viewpoint panoramic videos, we further propose a reverse warping strategy to construct the training dataset on easily accessible monocular panoramic videos. Extensive experiments demonstrate that 360Explorer achieves superior performance in creating 4D controllable panoramic videos with camera transformation and object movements aligned with diverse provided instructions.

Downloads

Published

2026-03-14

How to Cite

Cheng, X., Zhou, H., Yu, W., Jia, T., Lin, B., Ge, Y., … Yuan, L. (2026). 360Explorer: Exploring 4D Controllable World in Panoramic Videos. Proceedings of the AAAI Conference on Artificial Intelligence, 40(5), 3300–3308. https://doi.org/10.1609/aaai.v40i5.37325

Issue

Section

AAAI Technical Track on Computer Vision II