PBR3DGen: A VLM-Guided Mesh Generation with High-Quality PBR Texture

Authors

  • Xiaokang Wei The Hong Kong Polytechnic University Tencent Hunyuan3D
  • Bowen Zhang Tencent Hunyuan3D
  • Xianghui Yang Tencent Hunyuan3D
  • Yuxuan Wang Tencent Hunyuan3D Nanyang Technological University
  • Chunchao Guo Tencent Hunyuan3D
  • Xi Zhao Xi'an Jiaotong University
  • Yan Luximon The Hong Kong Polytechnic University

DOI:

https://doi.org/10.1609/aaai.v40i13.38030

Abstract

Generating high-quality physically based rendering (PBR) materials is important to achieve realistic rendering in the downstream tasks, yet it remains challenging due to the intertwined effects of materials and lighting. While existing methods have made breakthroughs by incorporating material decomposition in the 3D generation pipeline, they tend to bake highlights into albedo and ignore spatially varying properties of metallicity and roughness. In this work, we present PBR3DGen, a two-stage mesh generation method with high-quality PBR materials that integrates the novel multi-view PBR material estimation model and a 3D PBR mesh reconstruction model. Specifically, PBR3DGen leverages vision language models (VLM) to guide multi-view diffusion, precisely capturing the spatial distribution and inherent attributes of reflective-metalness material. Additionally, we incorporate view-dependent illumination-aware conditions as pixel-aware priors to enhance spatially varying material properties. Furthermore, our reconstruction model reconstructs high-quality mesh with PBR materials. Experimental results demonstrate that PBR3DGen significantly outperforms existing methods, achieving new state-of-the-art results for PBR estimation and mesh generation.

Downloads

Published

2026-03-14

How to Cite

Wei, X., Zhang, B., Yang, X., Wang, Y., Guo, C., Zhao, X., & Luximon, Y. (2026). PBR3DGen: A VLM-Guided Mesh Generation with High-Quality PBR Texture. Proceedings of the AAAI Conference on Artificial Intelligence, 40(13), 10575-10583. https://doi.org/10.1609/aaai.v40i13.38030

Issue

Section

AAAI Technical Track on Computer Vision X