Beyond Single-Point Perturbation: A Hierarchical, Manifold-Aware Approach to Diffusion Attacks
DOI:
https://doi.org/10.1609/aaai.v40i12.38013Abstract
Latent Diffusion Models have become a powerful tool for generating high-fidelity unrestricted adversarial examples. However, the existing methods typically perturb only the initial latent or rely on prompt engineering, which is ill-suited to the iterative nature of the diffusion process, plus optimization instability due to external text prompts and cumulative drift that push the adversarial images off the data manifold. In this paper, we propose a hierarchical attack framework that operates in alignment with the model's generative manifold and leverages intermediate denoising states to maximize attack transferability and visual fidelity. Extensive experiments show that the proposed attack improves adversarial transferability by 10-20% against a diverse set of normally-trained models and achieves over 10.5% higher success rate against adversarially-defended models, while simultaneously enhancing visual quality by 1.0-1.2 FID reduction and 16.7% LPIPS improvements.Published
2026-03-14
How to Cite
Wang, Z., Wang, L., Wen, Z., & Wang, C. (2026). Beyond Single-Point Perturbation: A Hierarchical, Manifold-Aware Approach to Diffusion Attacks. Proceedings of the AAAI Conference on Artificial Intelligence, 40(12), 10421–10429. https://doi.org/10.1609/aaai.v40i12.38013
Issue
Section
AAAI Technical Track on Computer Vision IX