Zhang, X. (2026) “Safety Alignment of Large Language Models via Contrasting Safe and Harmful Distributions”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(41), pp. 34827–34835. doi: 10.1609/aaai.v40i41.40785.