BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards

Yiran Yang; Zhaowei Liu; Yuan Yuan; Yukun Song; Xiong Ma; Yinghao Song; Xiangji Zeng; Lu Sun; Yulu Wang; Hai Zhou; Shuai Cui; Zhaohan Gong; Jiefei Zhang

doi:10.1609/aaai.v40i42.40914

Authors

Yiran Yang Kuaishou Technology Beijing University of Posts and Telecommunications
Zhaowei Liu Kuaishou Technology
Yuan Yuan Kuaishou Technology
Yukun Song Kuaishou Technology Beijing University of Posts and Telecommunications
Xiong Ma Kuaishou Technology
Yinghao Song Kuaishou Technology
Xiangji Zeng Kuaishou Technology
Lu Sun Kuaishou Technology
Yulu Wang Kuaishou Technology
Hai Zhou Kuaishou Technology
Shuai Cui Kuaishou Technology Shandong University
Zhaohan Gong Kuaishou Technology
Jiefei Zhang Kuaishou Technology

DOI:

https://doi.org/10.1609/aaai.v40i42.40914

Abstract

Short-video platforms now host vast multimodal ads whose deceptive visuals, speech and subtitles demand finer-grained, policy-driven moderation than community safety filters. We present BLM-Guard, a content-audit framework for commercial ads that fuses Chain-of-Thought reasoning with rule-based policy principles and a critic-guided reward. A rule-driven ICoT data-synthesis pipeline jump-starts training by generating structured scene descriptions, reasoning chains and labels, cutting annotation costs. Reinforcement learning then refines the model using a composite reward balancing causal coherence with policy adherence. A multitask architecture models intra-modal manipulations (e.g., exaggerated imagery) and cross-modal mismatches (e.g., subtitle–speech drift), boosting robustness. Experiments on real short-video ads show BLM-Guard surpasses strong baselines in accuracy, consistency and generalization.

BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information