[1]

Z. Cao, Y. Yang, and H. Zhao, “SCANS: Mitigating the Exaggerated Safety for LLMs via Safety-Conscious Activation Steering”, AAAI, vol. 39, no. 22, pp. 23523–23531, Apr. 2025.