Novel Motion Patterns Matter for Practical Skeleton-Based Action Recognition

Authors

  • Mengyuan Liu Key Laboratory of Machine Perception, Peking University, Shenzhen Graduate School
  • Fanyang Meng Peng Cheng Laboratory
  • Chen Chen University of Central Florida
  • Songtao Wu Sony R&D Center China

DOI:

https://doi.org/10.1609/aaai.v37i2.25258

Keywords:

CV: 3D Computer Vision, CV: Video Understanding & Activity Analysis

Abstract

Most skeleton-based action recognition methods assume that the same type of action samples in the training set and the test set share similar motion patterns. However, action samples in real scenarios usually contain novel motion patterns which are not involved in the training set. As it is laborious to collect sufficient training samples to enumerate various types of novel motion patterns, this paper presents a practical skeleton-based action recognition task where the training set contains common motion patterns of action samples and the test set contains action samples that suffer from novel motion patterns. For this task, we present a Mask Graph Convolutional Network (Mask-GCN) to focus on learning action-specific skeleton joints that mainly convey action information meanwhile masking action-agnostic skeleton joints that convey rare action information and suffer more from novel motion patterns. Specifically, we design a policy network to learn layer-wise body masks to construct masked adjacency matrices, which guide a GCN-based backbone to learn stable yet informative action features from dynamic graph structure. Extensive experiments on our newly collected dataset verify that Mask-GCN outperforms most GCN-based methods when testing with various novel motion patterns.

Downloads

Published

2023-06-26

How to Cite

Liu, M., Meng, F., Chen, C., & Wu, S. (2023). Novel Motion Patterns Matter for Practical Skeleton-Based Action Recognition. Proceedings of the AAAI Conference on Artificial Intelligence, 37(2), 1701-1709. https://doi.org/10.1609/aaai.v37i2.25258

Issue

Section

AAAI Technical Track on Computer Vision II