Chen, Z. and Tan, V. Y. F. (2026) “On the Exponential Convergence for Offline RLHF with Pairwise Comparisons”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(44), pp. 37277–37285. doi: 10.1609/aaai.v40i44.41059.