HumanPro: Single-view 3D Clothed Human Reconstruction with Progressive Normal Guidance

Authors

  • Jianchi Sun Wuhan University
  • Fei Luo Wuhan University
  • Wenzhuo Fan Wuhan University
  • Yu Jiang Wuhan University
  • Chunxia Xiao Wuhan University

DOI:

https://doi.org/10.1609/aaai.v40i11.37875

Abstract

Reconstructing fine-grained geometry of clothed human from single-view image is a challenging task, particularly in accurately recovering complex shapes and generating clothes details. To address these limitations, we propose a novel approach named HumanPro, which estimates high-quality human normals via a generative model, and progressively deforms a parametric body into the final clothed human mesh guided by normals. First, we propose a geometry-aware latent diffusion model with a normal enhancer to estimate high-quality human normals from four views. Then, we propose a progressive mesh optimization consisting of shape-aware deformation alignment and global-to-patch detail refinement for human mesh reconstruction. The shape-aware deformation alignment applies image morphing to learn the shape-level gap of normals, addressing large-scale deformation of complex clothes. It can recover the overall silhouette of a clothed human, and serves as an initialization for the global-to-patch detail refinement. Our detail refinement combines global and patch-wise optimization strategies to iteratively produce the clothed human mesh by minimizing the pixel-level difference of normals. This way effectively recovers fine-grained details while avoiding local minima. Extensive experiments demonstrate that HumanPro can deal with various challenging scenarios and outperforms state-of-the-art methods.

Published

2026-03-14

How to Cite

Sun, J., Luo, F., Fan, W., Jiang, Y., & Xiao, C. (2026). HumanPro: Single-view 3D Clothed Human Reconstruction with Progressive Normal Guidance. Proceedings of the AAAI Conference on Artificial Intelligence, 40(11), 9180–9188. https://doi.org/10.1609/aaai.v40i11.37875

Issue

Section

AAAI Technical Track on Computer Vision VIII