(1)
Cao, Z.; Yang, Y.; Zhao, H. SCANS: Mitigating the Exaggerated Safety for LLMs via Safety-Conscious Activation Steering. AAAI 2025, 39, 23523-23531.