[1]
P. Zhang, “FUSE: Fine-Grained and Semantic-Aware Learning for Unified Image Understanding and Generation”, AAAI, vol. 40, no. 33, pp. 28355–28363, Mar. 2026.