BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards

Authors

  • Yiran Yang Kuaishou Technology Beijing University of Posts and Telecommunications
  • Zhaowei Liu Kuaishou Technology
  • Yuan Yuan Kuaishou Technology
  • Yukun Song Kuaishou Technology Beijing University of Posts and Telecommunications
  • Xiong Ma Kuaishou Technology
  • Yinghao Song Kuaishou Technology
  • Xiangji Zeng Kuaishou Technology
  • Lu Sun Kuaishou Technology
  • Yulu Wang Kuaishou Technology
  • Hai Zhou Kuaishou Technology
  • Shuai Cui Kuaishou Technology Shandong University
  • Zhaohan Gong Kuaishou Technology
  • Jiefei Zhang Kuaishou Technology

DOI:

https://doi.org/10.1609/aaai.v40i42.40914

Abstract

Short-video platforms now host vast multimodal ads whose deceptive visuals, speech and subtitles demand finer-grained, policy-driven moderation than community safety filters. We present BLM-Guard, a content-audit framework for commercial ads that fuses Chain-of-Thought reasoning with rule-based policy principles and a critic-guided reward. A rule-driven ICoT data-synthesis pipeline jump-starts training by generating structured scene descriptions, reasoning chains and labels, cutting annotation costs. Reinforcement learning then refines the model using a composite reward balancing causal coherence with policy adherence. A multitask architecture models intra-modal manipulations (e.g., exaggerated imagery) and cross-modal mismatches (e.g., subtitle–speech drift), boosting robustness. Experiments on real short-video ads show BLM-Guard surpasses strong baselines in accuracy, consistency and generalization.

Downloads

Published

2026-03-14

How to Cite

Yang, Y., Liu, Z., Yuan, Y., Song, Y., Ma, X., Song, Y., … Zhang, J. (2026). BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards. Proceedings of the AAAI Conference on Artificial Intelligence, 40(42), 35985–35993. https://doi.org/10.1609/aaai.v40i42.40914

Issue

Section

AAAI Technical Track on Philosophy and Ethics of AI