Huang, Rongjie, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, et al. “AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head”. Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 21 (March 24, 2024): 23802–23804. Accessed May 23, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/30570.