Huang, B., Tan, Z., Wang, H., Liu, Z., Li, D., Payani, A., Liu, H., Chen, T., & Shu, K. (2026). Model Editing as a Double-Edged Sword: Steering Agent Behavior Toward Beneficence or Harm. Proceedings of the AAAI Conference on Artificial Intelligence, 40(37), 31113-31121. https://doi.org/10.1609/aaai.v40i37.40372