1.
Wei J, Zhan H, Lu Y, Tu X, Yin B, Liu C, et al. Image as a Language: Revisiting Scene Text Recognition via Balanced, Unified and Synchronized Vision-Language Reasoning Network. AAAI [Internet]. 2024 Mar. 24 [cited 2026 May 27];38(6):5885-93. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/28402