Yu, Yonghui, et al. “End-to-End Multi-Person Pose Estimation With Pose-Aware Video Transformer”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 14, Mar. 2026, pp. 12196-03, doi:10.1609/aaai.v40i14.38210.