Huang, R., Li, M., Yang, D., Shi, J., Chang, X., Ye, Z., Wu, Y., Hong, Z., Huang, J., Liu, J., Ren, Y., Zou, Y., Zhao, Z., & Watanabe, S. (2024). AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. Proceedings of the AAAI Conference on Artificial Intelligence, 38(21), 23802-23804. https://doi.org/10.1609/aaai.v38i21.30570