(1)
Song, F.; Yu, B.; Li, M.; Yu, H.; Huang, F.; Li, Y.; Wang, H. Preference Ranking Optimization for Human Alignment. AAAI 2024, 38, 18990-18998.