Wang, Guansu, and Peijie Sun. “Speech Recognition Model Improves Text-to-Speech Synthesis Using Fine-Grained Reward”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 39 (March 14, 2026): 33440–33448. Accessed May 12, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/40631.