[1]
Y. Shi, J. Wang, Z. Shan, D. Peng, Z. Lin, and L. Jin, “URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding”, AAAI, vol. 40, no. 30, pp. 25357–25365, Mar. 2026.