Sun, L. ., Zhang, K., Li, Q., & Lou, R. (2024). UMIE: Unified Multimodal Information Extraction with Instruction Tuning. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 19062–19070. https://doi.org/10.1609/aaai.v38i17.29873