[1]
Zhang, C., Chong, D., Jiang, F., Tang, C., Gao, A., Tang, G. and Li, H. 2025. Aligning Language Models Using Follow-up Likelihood as Reward Signal. Proceedings of the AAAI Conference on Artificial Intelligence. 39, 24 (Apr. 2025), 25832-25841. DOI:https://doi.org/10.1609/aaai.v39i24.34776.