Yu, Y., Cai, J., Wang, X., & Yang, W. (2026). End-to-End Multi-Person Pose Estimation with Pose-Aware Video Transformer. Proceedings of the AAAI Conference on Artificial Intelligence, 40(14), 12196–12203. https://doi.org/10.1609/aaai.v40i14.38210