BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards
DOI:
https://doi.org/10.1609/aaai.v40i42.40914Abstract
Short-video platforms now host vast multimodal ads whose deceptive visuals, speech and subtitles demand finer-grained, policy-driven moderation than community safety filters. We present BLM-Guard, a content-audit framework for commercial ads that fuses Chain-of-Thought reasoning with rule-based policy principles and a critic-guided reward. A rule-driven ICoT data-synthesis pipeline jump-starts training by generating structured scene descriptions, reasoning chains and labels, cutting annotation costs. Reinforcement learning then refines the model using a composite reward balancing causal coherence with policy adherence. A multitask architecture models intra-modal manipulations (e.g., exaggerated imagery) and cross-modal mismatches (e.g., subtitle–speech drift), boosting robustness. Experiments on real short-video ads show BLM-Guard surpasses strong baselines in accuracy, consistency and generalization.Published
2026-03-14
How to Cite
Yang, Y., Liu, Z., Yuan, Y., Song, Y., Ma, X., Song, Y., … Zhang, J. (2026). BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards. Proceedings of the AAAI Conference on Artificial Intelligence, 40(42), 35985–35993. https://doi.org/10.1609/aaai.v40i42.40914
Issue
Section
AAAI Technical Track on Philosophy and Ethics of AI