Rethinking Target Label Conditioning in Adversarial Attacks: A 2D Tensor-Guided Generative Approach

Authors

  • Hangyu Liu Zhejiang University Westlake University
  • Bo Peng Beijing University of Posts and Telecommunications
  • Pengxiang Ding Zhejiang University Westlake University
  • Donglin Wang Westlake University

DOI:

https://doi.org/10.1609/aaai.v40i9.37651

Abstract

Compared to single-target adversarial attacks, multi-target attacks have garnered significant attention due to their ability to generate adversarial images for multiple target classes simultaneously. However, existing generative approaches for multi-target attacks primarily encode target labels into one-dimensional tensors, leading to a loss of fine-grained visual information and overfitting to model-specific features during noise generation. To address this gap, we first identify and validate that the semantic feature quality and quantity are critical factors affecting the transferability of targeted attacks: 1) Feature quality refers to the structural and detailed completeness of the implanted target features, as deficiencies may result in the loss of key discriminative information; 2) Feature quantity refers to the spatial sufficiency of the implanted target features, as inadequacy limits the victim model's attention to this feature. Based on these findings, we propose the 2D Tensor-Guided Adversarial Fusion (TGAF) framework, which leverages the powerful generative capabilities of diffusion models to encode target labels into two-dimensional semantic tensors for guiding adversarial noise generation. Additionally, we design a novel masking strategy tailored for the training process, ensuring that parts of the generated noise retain complete semantic information about the target class. Extensive experiments demonstrate that TGAF consistently surpasses state-of-the-art methods across various settings.

Downloads

Published

2026-03-14

How to Cite

Liu, H., Peng, B., Ding, P., & Wang, D. (2026). Rethinking Target Label Conditioning in Adversarial Attacks: A 2D Tensor-Guided Generative Approach. Proceedings of the AAAI Conference on Artificial Intelligence, 40(9), 7150–7158. https://doi.org/10.1609/aaai.v40i9.37651

Issue

Section

AAAI Technical Track on Computer Vision VI