[1]

Y. Huang, T. Luo, H. Guo, and Y. Zhang, “Text-Guided Gradient Refinement: Resolving Multimodal Gradient Conflicts to Boost Adversarial Attacks on Vision-Language Models”, AAAI, vol. 40, no. 7, pp. 5212–5220, Mar. 2026.