(1)
Wang, C.; Huo, Y.; Gan, Y.; Mu, Y.; He, Q.; Yang, M.; Li, B.; Zhang, C.; Liu, T.; Ma, A. Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models. AAAI 2026, 40, 33404-33412.