[1]
D. Hahm, T. Min, W. Jin, and K. Lee, “Unintended Misalignment from Agentic Fine-Tuning: Risks and Mitigation”, AAAI, vol. 40, no. 44, pp. 37443–37451, Mar. 2026.