Wang, Chenglong, Hang Zhou, Yimin Hu, Yifu Huo, Bei Li, Tongran Liu, Tong Xiao, and Jingbo Zhu. 2024. “ESRL: Efficient Sampling-Based Reinforcement Learning for Sequence Generation”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (17):19107-15. https://doi.org/10.1609/aaai.v38i17.29878.