QRShield: Exploiting Vulnerabilities of Latent Diffusion Models for Preventing AI Art Plagiarism

Xunyue Mo; Weibin Wu; Qingrui Tu; Hang Wang; Junxi He; Zibin Zheng

doi:10.1609/aaai.v40i10.37756

Authors

Xunyue Mo School of Software Engineering, Zhuhai Key Laboratory of Trusted Large Language Models, Sun Yat-sen University
Weibin Wu School of Software Engineering, Zhuhai Key Laboratory of Trusted Large Language Models, Sun Yat-sen University
Qingrui Tu School of Software Engineering, Zhuhai Key Laboratory of Trusted Large Language Models, Sun Yat-sen University
Hang Wang School of Software Engineering, Zhuhai Key Laboratory of Trusted Large Language Models, Sun Yat-sen University
Junxi He School of Software Engineering, Zhuhai Key Laboratory of Trusted Large Language Models, Sun Yat-sen University
Zibin Zheng School of Software Engineering, Zhuhai Key Laboratory of Trusted Large Language Models, Sun Yat-sen University

DOI:

https://doi.org/10.1609/aaai.v40i10.37756

Abstract

Latent Diffusion Models (LDMs) have achieved remarkable success in image generation tasks, yet their low barrier to customization poses severe threats related to art plagiarism. As a countermeasure, adversarial methods have been proposed to protect artworks from plagiarism. However, current methods suffer from limited effectiveness, high cost, and complex optimization. Moreover, their exploration and exploitation of LDM vulnerabilities remain limited, restricting effectiveness and applicability. To address this issue, we analyze the VAE and U-Net components of LDMs, revealing their vulnerabilities. Specifically, we study the response of U-Net to specific structural and frequency patterns in the latent space and find that it is susceptible to high-frequency and periodic latent features. Furthermore, we observe channel correlations during the VAE encoding process. Inspired by these, we propose QRShield, an efficient protection method that exploits the vulnerabilities of LDMs. By constructing high-frequency and periodic features consistent across latent channels and combining them with a momentum-based translation-invariant attack strategy, QRShield achieves stronger and more efficient protection. QRShield significantly improves protection performance in various fine-tuning settings, with over 10% gains in multiple metrics, a threefold increase in generation speed, and nearly 50% reduction in memory usage. Therefore, our work offers a more practical method to prevent AI art plagiarism.

QRShield: Exploiting Vulnerabilities of Latent Diffusion Models for Preventing AI Art Plagiarism

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information