WorldGrow: Generating Infinite 3D World

Authors

  • Sikuang Li MoE Key Lab of Artificial Intelligence, School of Computer Science, Shanghai Jiao Tong University
  • Chen Yang Huawei Inc.
  • Jiemin Fang Huawei Inc.
  • Taoran Yi Huazhong University of Science and Technology
  • Jia Lu Huazhong University of Science and Technology
  • Jiazhong Cen MoE Key Lab of Artificial Intelligence, School of Computer Science, Shanghai Jiao Tong University
  • Lingxi Xie Huawei Inc.
  • Wei Shen MoE Key Lab of Artificial Intelligence, School of Computer Science, Shanghai Jiao Tong University
  • Qi Tian Huawei Inc.

DOI:

https://doi.org/10.1609/aaai.v40i8.37571

Abstract

We tackle the challenge of generating the infinitely extendable 3D world -- large, continuous environments with coherent geometry and realistic appearance. Existing methods face key challenges: 2D-lifting approaches suffer from geometric and appearance inconsistencies across views, 3D implicit representations are hard to scale up, and current 3D foundation models are mostly object-centric, limiting their applicability to scene-level generation. Our key insight is leveraging strong generation priors from pre-trained 3D models for structured scene block generation. To this end, we propose WorldGrow, a hierarchical framework for unbounded 3D scene synthesis. Our method features three core components: (1) a data curation pipeline that extracts high-quality scene blocks for training, making the 3D structured latent representations suitable for scene generation; (2) a 3D block inpainting mechanism that enables context-aware scene extension; and (3) a coarse-to-fine generation strategy that ensures both global layout plausibility and local geometric/textural fidelity. Evaluated on the large-scale 3D-FRONT dataset, WorldGrow achieves SOTA performance in geometry reconstruction, while uniquely supporting infinite scene generation with photorealistic and structurally consistent outputs. These results highlight its capability for constructing large-scale virtual environments and potential for building future world models.

Published

2026-03-14

How to Cite

Li, S., Yang, C., Fang, J., Yi, T., Lu, J., Cen, J., … Tian, Q. (2026). WorldGrow: Generating Infinite 3D World. Proceedings of the AAAI Conference on Artificial Intelligence, 40(8), 6433–6441. https://doi.org/10.1609/aaai.v40i8.37571

Issue

Section

AAAI Technical Track on Computer Vision V