Real3D: The Curious Case of Neural Scene Degeneration

Authors

  • Dengsheng Chen Meituan
  • Jie Hu Meituan
  • Xiaoming Wei Meituan
  • Enhua Wu State Key Laboratory of Computer Science, ISCAS University of Chinese Academy of Sciences University of Macau

DOI:

https://doi.org/10.1609/aaai.v38i2.27863

Keywords:

CV: 3D Computer Vision, CV: Computational Photography, Image & Video Synthesis, CV: Applications, General

Abstract

Despite significant progress in utilizing pre-trained text-to-image diffusion models to guide the creation of 3D scenes, these methods often struggle to generate scenes that are sufficiently realistic, leading to "neural scene degeneration". In this work, we propose a new 3D scene generation model called Real3D. Specifically, Real3D designs a pipeline from a NeRF-like implicit renderer to a tetrahedrons-based explicit renderer, greatly improving the neural network's ability to generate various neural scenes. Moreover, Real3D introduces an additional discriminator to prevent neural scenes from falling into undesirable local optima, thus avoiding the degeneration phenomenon. Our experimental results demonstrate that Real3D outperforms all existing state-of-the-art text-to-3D generation methods, providing valuable insights to facilitate the development of learning-based 3D scene generation approaches.

Published

2024-03-24

How to Cite

Chen, D., Hu, J., Wei, X., & Wu, E. (2024). Real3D: The Curious Case of Neural Scene Degeneration. Proceedings of the AAAI Conference on Artificial Intelligence, 38(2), 1028–1036. https://doi.org/10.1609/aaai.v38i2.27863

Issue

Section

AAAI Technical Track on Computer Vision I