Zhu, Qiushi, Jie Zhang, Yu Gu, Yuchen Hu, and Lirong Dai. 2024. “Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (17):19768-76. https://doi.org/10.1609/aaai.v38i17.29951.