[1]
M. Vatsa, A. Bharati, and R. Singh, “Right Looks, Wrong Reasons: Compositional Fidelity in Text-to-Image Generation”, AAAI, vol. 40, no. 46, pp. 39797–39805, Mar. 2026.