Chen, C., Hu, Y., Zhang, Q., Zou, H., Zhu, B. and Chng, E. S. (2023) “Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning”, Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), pp. 12607-12615. doi: 10.1609/aaai.v37i11.26484.