[1]
Y. Chen, Z. Ma, J. Wang, K. Kang, S. Yao, and W. Zhang, “LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer”, AAAI, vol. 40, no. 4, pp. 3174–3182, Mar. 2026.