High-Quality Full-Head 3D Avatar Generation from Any Single Portrait Image

Authors

  • Yujie Gao Shanghai Jiao Tong University
  • ChenCheng Wang Shanghai Jiao Tong University
  • Xianbing Sun Shanghai Jiao Tong University
  • Jiahui Zhan Shanghai Jiao Tong University
  • Wentao Wang Shanghai AI Laboratory
  • Yiyi Zhang Shanghai Jiao Tong University
  • Haohua Zhao Shanghai Jiao Tong University
  • Liqing Zhang Shanghai Jiao Tong University
  • Jianfu Zhang Shanghai Jiao Tong University

DOI:

https://doi.org/10.1609/aaai.v40i6.42417

Abstract

In this work, we introduce a novel high-fidelity full-head 3D avatar generation method from a single image, regardless of perspective, style, expression, or accessories. Prior works often fail to preserve consistent head geometry and facial details, primarily due to their limited capacity in modeling fine-grained facial textures and maintaining identity information. To address these challenges, we construct a new high-quality dataset containing 227 sequences of digital human portraits captured from 96 different perspectives, totalling 21,792 frames, featuring high-quality facial texture details. To further improve performance, we propose a novel multi-view diffusion named ID-TS diffusion model, which integrate identity and expression information into the two-stage multi-view diffusion process. The low-resolution stage ensures structural consistency of heads across multiple views, while the high-resolution stage preserves facial detail fidelity and coherence. Finally, we propose an enhanced feed-forward Gaussian avatar reconstruction method that optimizes the network on multi-view images of each single subject, significantly improving 3D facial texture details. Extensive experiments show that our method demonstrates robust performance across challenging scenarios, while showcasing broad applicability across numerous downstream tasks.

Downloads

Published

2026-03-14

How to Cite

Gao, Y., Wang, C., Sun, X., Zhan, J., Wang, W., Zhang, Y., … Zhang, J. (2026). High-Quality Full-Head 3D Avatar Generation from Any Single Portrait Image. Proceedings of the AAAI Conference on Artificial Intelligence, 40(6), 4212–4220. https://doi.org/10.1609/aaai.v40i6.42417

Issue

Section

AAAI Technical Track on Computer Vision III