Weng, S.-E., Shuai, H.-H., & Cheng, W.-H. (2023). Zero-Shot Face-Based Voice Conversion: Bottleneck-Free Speech Disentanglement in the Real-World Scenario. Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), 13718-13726. https://doi.org/10.1609/aaai.v37i11.26607