Deng, Ruibo, Duanyu Feng, and Wenqiang Lei. 2026. “AMaPO: Adaptive Margin-Attached Preference Optimization for Language Model Alignment”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (44):37341-49. https://doi.org/10.1609/aaai.v40i44.41066.