[1]
J. Feng, S. Wu, H. Sun, P. Zhang, B. Ren, and S. Zhang, “Stabilizing Cross-Modal Bidirectional Attribution: Few-Shot Adversarial Prompt Tuning for Robust Vision-Language Models”, AAAI, vol. 40, no. 5, pp. 3939–3947, Mar. 2026.