3D Single-Person Concurrent Activity Detection Using Stacked Relation Network

Authors

  • Yi Wei University at Albany, State University of New York
  • Wenbo Li Samsung Research America AI Center
  • Yanbo Fan Tencent AI Laboratory
  • Linghan Xu Tianjin University
  • Ming-Ching Chang University at Albany, State University of New York
  • Siwei Lyu University at Albany, State University of New York

DOI:

https://doi.org/10.1609/aaai.v34i07.6917

Abstract

We aim to detect real-world concurrent activities performed by a single person from a streaming 3D skeleton sequence. Different from most existing works that deal with concurrent activities performed by multiple persons that are seldom correlated, we focus on concurrent activities that are spatio-temporally or causally correlated and performed by a single person. For the sake of generalization, we propose an approach based on a decompositional design to learn a dedicated feature representation for each activity class. To address the scalability issue, we further extend the class-level decompositional design to the postural-primitive level, such that each class-wise representation does not need to be extracted by independent backbones, but through a dedicated weighted aggregation of a shared pool of postural primitives. There are multiple interdependent instances deriving from each decomposition. Thus, we propose Stacked Relation Networks (SRN), with a specialized relation network for each decomposition, so as to enhance the expressiveness of instance-wise representations via the inter-instance relationship modeling. SRN achieves state-of-the-art performance on a public dataset and a newly collected dataset. The relation weights within SRN are interpretable among the activity contexts. The new dataset and code are available at https://github.com/weiyi1991/UA_Concurrent/

Downloads

Published

2020-04-03

How to Cite

Wei, Y., Li, W., Fan, Y., Xu, L., Chang, M.-C., & Lyu, S. (2020). 3D Single-Person Concurrent Activity Detection Using Stacked Relation Network. Proceedings of the AAAI Conference on Artificial Intelligence, 34(07), 12329-12337. https://doi.org/10.1609/aaai.v34i07.6917

Issue

Section

AAAI Technical Track: Vision