[1]
G. Yang, M. Li, J. Zhang, X. Lin, H. Ji, and S.-F. Chang, “Video Event Extraction via Tracking Visual States of Arguments”, AAAI, vol. 37, no. 3, pp. 3136-3144, Jun. 2023.