[1]
J. Spravil, S. Houben, and S. Behnke, “Scaling Laws for Conditional Emergence of Multilingual Image Captioning via Generalization from Translation”, AAAI, vol. 40, no. 30, pp. 25599–25607, Mar. 2026.