(1)
Prakash, N.; Jie, Y. W.; Abdullah, A.; Satapathy, R.; Cambria, E.; Lee, R. K.-W. Beyond I’m Sorry, I Can’t: Dissecting Large-Language-Model Refusal. AAAI 2026, 40, 37830-37838.