QI, X.; YANG, Y.; CAO, J.; BAI, L.; FAN, C.; CAO, C.; WANG, H. Continuous Vision-Language-Action Co-Learning with Semantic-Physical Alignment for Behavioral Cloning. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 29, p. 24900-24908, 2026. DOI: 10.1609/aaai.v40i29.39677. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/39677. Acesso em: 3 may. 2026.