Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and Detection

Authors

  • Long Qian Institute of Automation, Chinese Academy of Sciences School of Future Technology, University of Chinese Academy of Sciences
  • Bingke Zhu Institute of Automation, Chinese Academy of Sciences School of Future Technology, University of Chinese Academy of Sciences
  • Yingying Chen Institute of Automation, Chinese Academy of Sciences School of Future Technology, University of Chinese Academy of Sciences
  • Ming Tang Institute of Automation, Chinese Academy of Sciences School of Future Technology, University of Chinese Academy of Sciences
  • Jinqiao Wang Institute of Automation, Chinese Academy of Sciences School of Future Technology, University of Chinese Academy of Sciences Objecteye Inc.

DOI:

https://doi.org/10.1609/aaai.v40i18.38592

Abstract

Despite substantial progress in anomaly synthesis, existing diffusion-based and coarse inpainting pipelines commonly suffer from structural deficiencies such as micro-structural discontinuities, limited semantic controllability, and inefficient generation. To overcome these limitations, we introduce ARAS, a language-conditioned, auto-regressive anomaly synthesis approach that precisely injects local, text-specified defects into normal images via token-anchored latent editing. Leveraging a hard-gated auto-regressive operator and a training-free, context-preserving masked sampling kernel, ARAS significantly enhances defect realism, preserves fine-grained material textures, and provides continuous semantic control over synthesized anomalies. Integrated within our Quality-Aware Re-weighted Anomaly Detection (QARAD) framework, we propose a dynamic weighting strategy that emphasizes high-quality synthetic samples by computing an image-text similarity score with a dual-encoder model. Extensive experiments across three datasets, MVTec AD, VisA, and BTAD, demonstrate that our QARAD outperforms SOTA methods in both image- and pixel-level anomaly detection tasks, achieving improved accuracy, robustness, and a 5× synthesis speedup compared to diffusion-based alternatives.

Downloads

Published

2026-03-14

How to Cite

Qian, L., Zhu, B., Chen, Y., Tang, M., & Wang, J. (2026). Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 40(18), 15626–15634. https://doi.org/10.1609/aaai.v40i18.38592

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management II