Cai, Y., Yuan, Y., Shi, J., & Lin, Q. (2025). Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment. Proceedings of the AAAI Conference on Artificial Intelligence, 39(22), 23505-23513. https://doi.org/10.1609/aaai.v39i22.34519