[1]
N. Prakash, Y. W. Jie, A. Abdullah, R. Satapathy, E. Cambria, and R. K.-W. Lee, “Beyond I’m Sorry, I Can’t: Dissecting Large-Language-Model Refusal”, AAAI, vol. 40, no. 44, pp. 37830–37838, Mar. 2026.