LI, Yehao; PAN, Yingwei; YAO, Ting; CHEN, Jingwen; MEI, Tao. Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 35, n. 10, p. 8518–8526, 2021. DOI: 10.1609/aaai.v35i10.17034. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/17034. Acesso em: 29 may. 2026.