[1]
M. Wang, “A Multimodal, Multi-Task Adapting Framework for Video Action Recognition”, AAAI, vol. 38, no. 6, pp. 5517–5525, Mar. 2024.