(1)
Son, T.; Seo, S. W.; Kim, J.; Lee, S. H.; Choi, J. W. JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts. AAAI 2025, 39, 6940-6949.