Tang, Yunlong, Daiki Shimada, Jing Bi, Mingqian Feng, Hang Hua, and Chenliang Xu. 2025. “Empowering LLMs With Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding”. Proceedings of the AAAI Conference on Artificial Intelligence 39 (7):7293-7301. https://doi.org/10.1609/aaai.v39i7.32784.