[1]
X. Yi, S. Zheng, L. Wang, G. de Melo, X. Wang, and L. He, “NLSR: Neuron-Level Safety Realignment of Large Language Models Against Harmful Fine-Tuning”, AAAI, vol. 39, no. 24, pp. 25706–25714, Apr. 2025.