Yin, Han, Yafeng Chen, Chong Deng, Luyao Cheng, Hui Wang, Chao-Hong Tan, Qian Chen, Wen Wang, and Xiangang Li. 2026. “SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition With Multimodal Large Language Models”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (40):34467-75. https://doi.org/10.1609/aaai.v40i40.40745.