[1]
T. Ren, H. Wang, and K. Rafferty, “Enhancing Question Generation through Diversity-Seeking Reinforcement Learning with Bilevel Policy Decomposition”, AAAI, vol. 39, no. 23, pp. 25083–25091, Apr. 2025.