Yang, Guang, Manling Li, Jiajie Zhang, Xudong Lin, Heng Ji, and Shih-Fu Chang. “Video Event Extraction via Tracking Visual States of Arguments”. Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 3 (June 26, 2023): 3136–3144. Accessed May 28, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/25418.