Generating Attribute-Aware Human Motions from Textual Prompt

Authors

  • Xinghan Wang Peking University, China
  • Kun Xu Peking University, China
  • Fei Li China Tower Corporation Limited
  • Cao Sheng China Tower Corporation Limited
  • JiaZhong Yu China Tower Corporation Limited
  • Yadong Mu Peking University, China

DOI:

https://doi.org/10.1609/aaai.v40i12.37990

Abstract

Text-driven human motion generation has recently attracted considerable attention, allowing models to generate human motions based on textual descriptions. However, current methods neglect the influence of human attributes—such as age, gender, weight, and height—which are key factors shaping human motion patterns. This work represents a pilot exploration for bridging this gap. We conceptualize each motion as comprising both attribute information and action semantics, where textual descriptions align exclusively with action semantics. To achieve this, a new framework inspired by Structural Causal Models is proposed to decouple action semantics from human attributes, enabling text-to-semantics prediction and attribute-controlled generation. The resulting model is capable of generating attribute-aware motion aligned with the user's text and attribute inputs. For evaluation, we introduce a comprehensive dataset containing attribute annotations for text-motion pairs, setting the first benchmark for attribute-aware motion generation. Extensive experiments validate our model's effectiveness.

Downloads

Published

2026-03-14

How to Cite

Wang, X., Xu, K., Li, F., Sheng, C., Yu, J., & Mu, Y. (2026). Generating Attribute-Aware Human Motions from Textual Prompt. Proceedings of the AAAI Conference on Artificial Intelligence, 40(12), 10216–10224. https://doi.org/10.1609/aaai.v40i12.37990

Issue

Section

AAAI Technical Track on Computer Vision IX