Wang, Chenglong, Yifu Huo, Yang Gan, Yongyu Mu, Qiaozhi He, Murun Yang, Bei Li, et al. “Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 39 (March 14, 2026): 33404–33412. Accessed May 16, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/40627.