Liu, Y., Xu, L., Xiong, P., & Jin, Q. (2023). Token Mixing: Parameter-Efficient Transfer Learning from Image-Language to Video-Language. Proceedings of the AAAI Conference on Artificial Intelligence, 37(2), 1781-1789. https://doi.org/10.1609/aaai.v37i2.25267