AnoStyler: Text-Driven Localized Anomaly Generation via Lightweight Style Transfer

Authors

  • Yulim So Sungkyunkwan University
  • Seokho Kang Sungkyunkwan University

DOI:

https://doi.org/10.1609/aaai.v40i18.38604

Abstract

Anomaly generation has been widely explored to address the scarcity of anomaly images in real-world data. However, existing methods typically suffer from at least one of the following limitations, hindering their practical deployment: (1) lack of visual realism in generated anomalies; (2) dependence on large amounts of real images; and (3) use of memory-intensive, heavyweight model architectures. To overcome these limitations, we propose AnoStyler, a lightweight yet effective method that frames zero-shot anomaly generation as text-guided style transfer. Given a single normal image along with its category label and expected defect type, an anomaly mask indicating the localized anomaly regions and two-class text prompts representing the normal and anomaly states are generated using generalizable category-agnostic procedures. A lightweight U-Net model trained with CLIP-based loss functions is used to stylize the normal image into a visually realistic anomaly image, where anomalies are localized by the anomaly mask and semantically aligned with the text prompts. Extensive experiments on the MVTec-AD and VisA datasets show that AnoStyler outperforms existing anomaly generation methods in generating high-quality and diverse anomaly images. Furthermore, using these generated anomalies helps enhance anomaly detection performance.

Downloads

Published

2026-03-14

How to Cite

So, Y., & Kang, S. (2026). AnoStyler: Text-Driven Localized Anomaly Generation via Lightweight Style Transfer. Proceedings of the AAAI Conference on Artificial Intelligence, 40(18), 15734–15742. https://doi.org/10.1609/aaai.v40i18.38604

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management II