[1]
R. Deng, D. Feng, and W. Lei, “AMaPO: Adaptive Margin-attached Preference Optimization for Language Model Alignment”, AAAI, vol. 40, no. 44, pp. 37341–37349, Mar. 2026.