Fan, Yuchen, Yuzhong Hong, Qiushi Wang, Junwei Bao, Hongfei Jiang, and Yang Song. 2025. “Preference-Oriented Supervised Fine-Tuning: Favoring Target Model over Aligned Large Language Models”. Proceedings of the AAAI Conference on Artificial Intelligence 39 (22):23859-67. https://doi.org/10.1609/aaai.v39i22.34558.