Chen, C., Y. Hu, Q. Zhang, H. Zou, B. Zhu, and E. S. Chng. “Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 37, no. 11, June 2023, pp. 12607-15, doi:10.1609/aaai.v37i11.26484.