DECON: Reconstruction of Clothed-Geometric Multiple Humans from a Single Image via Geometry-Guided Decoupling

Authors

  • Yiming Jiang State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, China
  • Wenfeng Song Computer School, Beijing Information Science and Technology University, China
  • Shuai Li State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, China Zhongguancun Laboratory, China
  • Aimin Hao State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, China Research Unit of Virtual Human and Virtual Surgery (2019RU004), Chinese Academy of Medical Sciences

DOI:

https://doi.org/10.1609/aaai.v40i7.37463

Abstract

3D multi-human reconstruction from single images holds significant potential for advancing AR/VR applications. While remarkable progress has been made in single-human reconstruction, existing methods face challenges when reconstructing multiple humans. These challenges include: (1) severe inter-occlusion that disrupts individual body structures, and (2) the absence of physically plausible relative positioning among subjects. We present DECON, a novel DEcouple-and-reCONstruct framework that systematically addresses these limitations through two technical innovations: (1) a decouple-and-reconstruct framework with multi-view synthesis. It separates individuals and reconstructs detailed 3D bodies from a single image. (2) a Perspective-Aware Position Optimization (PAPO) approach. It ensures realistic positioning by fixing overlaps and gaps between subjects. Extensive experiments demonstrate our method's capability to reconstruct fully separated, anatomically complete 3D humans with clothed-geometric details and plausible interactions. Quantitative evaluations show a 54% reduction in Chamfer Distance and 35% in Point-to-Surface Distance compared to state-of-the-art methods.

Downloads

Published

2026-03-14

How to Cite

Jiang, Y., Song, W., Li, S., & Hao, A. (2026). DECON: Reconstruction of Clothed-Geometric Multiple Humans from a Single Image via Geometry-Guided Decoupling. Proceedings of the AAAI Conference on Artificial Intelligence, 40(7), 5459–5467. https://doi.org/10.1609/aaai.v40i7.37463

Issue

Section

AAAI Technical Track on Computer Vision IV