[1]
C. Wang, “Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models”, AAAI, vol. 40, no. 39, pp. 33404–33412, Mar. 2026.