Gao, Lishuai, Jun-Yan He, Yingsen Zeng, Yujie Zhong, Xiaopeng Sun, Jie Hu, Zan Gao, and Xiaoming Wei. “ViType: High-Fidelity Visual Text Rendering via Glyph-Aware Multimodal Diffusion”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 6 (March 14, 2026): 4131–4139. Accessed May 11, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/42408.