ShapeBoost: Boosting Human Shape Estimation with Part-Based Parameterization and Clothing-Preserving Augmentation

Authors

  • Siyuan Bian Department of Computer Science and Engineering, Shanghai Jiao Tong University
  • Jiefeng Li Department of Computer Science and Engineering, Shanghai Jiao Tong University
  • Jiasheng Tang DAMO Academy, Alibaba group Hupan Lab
  • Cewu Lu Department of Computer Science and Engineering, Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University

DOI:

https://doi.org/10.1609/aaai.v38i2.27841

Keywords:

CV: Motion & Tracking, CV: 3D Computer Vision

Abstract

Accurate human shape recovery from a monocular RGB image is a challenging task because humans come in different shapes and sizes and wear different clothes. In this paper, we propose ShapeBoost, a new human shape recovery framework that achieves pixel-level alignment even for rare body shapes and high accuracy for people wearing different types of clothes. Unlike previous approaches that rely on the use of PCA-based shape coefficients, we adopt a new human shape parameterization that decomposes the human shape into bone lengths and the mean width of each part slice. This part-based parameterization technique achieves a balance between flexibility and validity using a semi-analytical shape reconstruction algorithm. Based on this new parameterization, a clothing-preserving data augmentation module is proposed to generate realistic images with diverse body shapes and accurate annotations. Experimental results show that our method outperforms other state-of-the-art methods in diverse body shape situations as well as in varied clothing situations.

Published

2024-03-24

How to Cite

Bian, S., Li, J., Tang, J., & Lu, C. (2024). ShapeBoost: Boosting Human Shape Estimation with Part-Based Parameterization and Clothing-Preserving Augmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 38(2), 828-836. https://doi.org/10.1609/aaai.v38i2.27841

Issue

Section

AAAI Technical Track on Computer Vision I