Zhang, Xiao-Yu, Haichao Shi, Changsheng Li, and Peng Li. “Multi-Instance Multi-Label Action Recognition and Localization Based on Spatio-Temporal Pre-Trimming for Untrimmed Videos”. Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 07 (April 3, 2020): 12886-12893. Accessed May 7, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/6986.