Wang, H. (2025) “Efficient and Robust Reinforcement Learning from Human Feedback”, Proceedings of the AAAI Conference on Artificial Intelligence, 39(27), pp. 28730–28730. doi: 10.1609/aaai.v39i27.35123.