Human Joint Kinematics Diffusion-Refinement for Stochastic Motion Prediction

Authors

  • Dong Wei Nanjing University of Science and Technology
  • Huaijiang Sun Nanjing University of Science and Technology
  • Bin Li Tianjin AiForward Science and Technology Co., Ltd.
  • Jianfeng Lu Nanjing University of Science and Technology
  • Weiqing Li Nanjing University of Science and Technology
  • Xiaoning Sun Nanjing University of Science and Technology
  • Shengxiang Hu Nanjing University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v37i5.25754

Keywords:

HAI: Human-Aware Planning and Behavior Prediction, CV: Motion & Tracking, ML: Deep Generative Models & Autoencoders

Abstract

Stochastic human motion prediction aims to forecast multiple plausible future motions given a single pose sequence from the past. Most previous works focus on designing elaborate losses to improve the accuracy, while the diversity is typically characterized by randomly sampling a set of latent variables from the latent prior, which is then decoded into possible motions. This joint training of sampling and decoding, however, suffers from posterior collapse as the learned latent variables tend to be ignored by a strong decoder, leading to limited diversity. Alternatively, inspired by the diffusion process in nonequilibrium thermodynamics, we propose MotionDiff, a diffusion probabilistic model to treat the kinematics of human joints as heated particles, which will diffuse from original states to a noise distribution. This process not only offers a natural way to obtain the "whitened'' latents without any trainable parameters, but also introduces a new noise in each diffusion step, both of which facilitate more diverse motions. Human motion prediction is then regarded as the reverse diffusion process that converts the noise distribution into realistic future motions conditioned on the observed sequence. Specifically, MotionDiff consists of two parts: a spatial-temporal transformer-based diffusion network to generate diverse yet plausible motions, and a flexible refinement network to further enable geometric losses and align with the ground truth. Experimental results on two datasets demonstrate that our model yields the competitive performance in terms of both diversity and accuracy.

Downloads

Published

2023-06-26

How to Cite

Wei, D., Sun, H., Li, B., Lu, J., Li, W., Sun, X., & Hu, S. (2023). Human Joint Kinematics Diffusion-Refinement for Stochastic Motion Prediction. Proceedings of the AAAI Conference on Artificial Intelligence, 37(5), 6110-6118. https://doi.org/10.1609/aaai.v37i5.25754

Issue

Section

AAAI Technical Track on Humans and AI