Beyond Single-Point Perturbation: A Hierarchical, Manifold-Aware Approach to Diffusion Attacks

Authors

  • Zhijie Wang Zhejiang University
  • Lin Wang Hangzhou Dianzi University
  • Zhenyu Wen Zhejiang University of Technology
  • Cong Wang Zhejiang University

DOI:

https://doi.org/10.1609/aaai.v40i12.38013

Abstract

Latent Diffusion Models have become a powerful tool for generating high-fidelity unrestricted adversarial examples. However, the existing methods typically perturb only the initial latent or rely on prompt engineering, which is ill-suited to the iterative nature of the diffusion process, plus optimization instability due to external text prompts and cumulative drift that push the adversarial images off the data manifold. In this paper, we propose a hierarchical attack framework that operates in alignment with the model's generative manifold and leverages intermediate denoising states to maximize attack transferability and visual fidelity. Extensive experiments show that the proposed attack improves adversarial transferability by 10-20% against a diverse set of normally-trained models and achieves over 10.5% higher success rate against adversarially-defended models, while simultaneously enhancing visual quality by 1.0-1.2 FID reduction and 16.7% LPIPS improvements.

Downloads

Published

2026-03-14

How to Cite

Wang, Z., Wang, L., Wen, Z., & Wang, C. (2026). Beyond Single-Point Perturbation: A Hierarchical, Manifold-Aware Approach to Diffusion Attacks. Proceedings of the AAAI Conference on Artificial Intelligence, 40(12), 10421–10429. https://doi.org/10.1609/aaai.v40i12.38013

Issue

Section

AAAI Technical Track on Computer Vision IX