(1)
Fan, Y.; Hong, Y.; Wang, Q.; Bao, J.; Jiang, H.; Song, Y. Preference-Oriented Supervised Fine-Tuning: Favoring Target Model over Aligned Large Language Models. AAAI 2025, 39, 23859-23867.