Hahm, D., Min, T., Jin, W., & Lee, K. (2026). Unintended Misalignment from Agentic Fine-Tuning: Risks and Mitigation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(44), 37443–37451. https://doi.org/10.1609/aaai.v40i44.41077