Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and Detection
DOI:
https://doi.org/10.1609/aaai.v40i18.38592Abstract
Despite substantial progress in anomaly synthesis, existing diffusion-based and coarse inpainting pipelines commonly suffer from structural deficiencies such as micro-structural discontinuities, limited semantic controllability, and inefficient generation. To overcome these limitations, we introduce ARAS, a language-conditioned, auto-regressive anomaly synthesis approach that precisely injects local, text-specified defects into normal images via token-anchored latent editing. Leveraging a hard-gated auto-regressive operator and a training-free, context-preserving masked sampling kernel, ARAS significantly enhances defect realism, preserves fine-grained material textures, and provides continuous semantic control over synthesized anomalies. Integrated within our Quality-Aware Re-weighted Anomaly Detection (QARAD) framework, we propose a dynamic weighting strategy that emphasizes high-quality synthetic samples by computing an image-text similarity score with a dual-encoder model. Extensive experiments across three datasets, MVTec AD, VisA, and BTAD, demonstrate that our QARAD outperforms SOTA methods in both image- and pixel-level anomaly detection tasks, achieving improved accuracy, robustness, and a 5× synthesis speedup compared to diffusion-based alternatives.Downloads
Published
2026-03-14
How to Cite
Qian, L., Zhu, B., Chen, Y., Tang, M., & Wang, J. (2026). Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 40(18), 15626–15634. https://doi.org/10.1609/aaai.v40i18.38592
Issue
Section
AAAI Technical Track on Data Mining & Knowledge Management II