1.
Chen C, Hu Y, Zhang Q, Zou H, Zhu B, Chng ES. Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning. AAAI [Internet]. 2023Jun.26 [cited 2024Feb.24];37(11):12607-15. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/26484