Cheng, Yu, Bo Wang, Bo Yang, and Robby T. Tan. “Graph and Temporal Convolutional Networks for 3D Multi-Person Pose Estimation in Monocular Videos”. Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 2 (May 18, 2021): 1157–1165. Accessed May 31, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/16202.