Yang, Yiran, et al. “BLM-Guard: Explainable Multimodal Ad Moderation With Chain-of-Thought and Policy-Aligned Rewards”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 42, Mar. 2026, pp. 35985-93, doi:10.1609/aaai.v40i42.40914.