[1]
G. Wang and P. Sun, “Speech Recognition Model Improves Text-to-Speech Synthesis Using Fine-Grained Reward”, AAAI, vol. 40, no. 39, pp. 33440–33448, Mar. 2026.