Deng, R., Feng, D., & Lei, W. (2026). AMaPO: Adaptive Margin-attached Preference Optimization for Language Model Alignment. Proceedings of the AAAI Conference on Artificial Intelligence, 40(44), 37341–37349. https://doi.org/10.1609/aaai.v40i44.41066