Huang, R., Li, M., Yang, D., Shi, J., Chang, X., Ye, Z., … Watanabe, S. (2024). AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. Proceedings of the AAAI Conference on Artificial Intelligence, 38(21), 23802–23804. https://doi.org/10.1609/aaai.v38i21.30570