Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations

Zilin Wang; Haolin Zhuang; Lu Li; Yinmin Zhang; Junjie Zhong; Jun Chen; Yu Yang; Boshi Tang; Zhiyong Wu

doi:10.1609/aaai.v38i1.27783

Authors

Zilin Wang Shenzhen International Graduate School, Tsinghua University
Haolin Zhuang Shenzhen International Graduate School, Tsinghua University
Lu Li Shenzhen International Graduate School, Tsinghua University
Yinmin Zhang The University of Sydney
Junjie Zhong Waseda University
Jun Chen Shenzhen International Graduate School, Tsinghua University
Yu Yang Shenzhen International Graduate School, Tsinghua University
Boshi Tang Shenzhen International Graduate School, Tsinghua University
Zhiyong Wu Shenzhen International Graduate School, Tsinghua University

DOI:

https://doi.org/10.1609/aaai.v38i1.27783

Keywords:

APP: Other Applications, ML: Applications

Abstract

This paper presents an Exploratory 3D Dance generation framework, E3D2, designed to address the exploration capability deficiency in existing music-conditioned 3D dance generation models. Current models often generate monotonous and simplistic dance sequences that misalign with human preferences because they lack exploration capabilities.The E3D2 framework involves a reward model trained from automatically-ranked dance demonstrations, which then guides the reinforcement learning process. This approach encourages the agent to explore and generate high quality and diverse dance movement sequences. The soundness of the reward model is both theoretically and experimentally validated. Empirical experiments demonstrate the effectiveness of E3D2 on the AIST++ dataset.

Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription