[1]
G. Piras, R. Mura, F. Brau, L. Oneto, F. Roli, and B. Biggio, “SOM Directions Are Better than One: Multi-Directional Refusal Suppression in Language Models”, AAAI, vol. 40, no. 39, pp. 32728–32736, Mar. 2026.