Wang, R., Chen, Z., Chen, C., Ma, J., Lu, H., & Lin, X. (2024). Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models. Proceedings of the AAAI Conference on Artificial Intelligence, 38(6), 5544–5552. https://doi.org/10.1609/aaai.v38i6.28364