LI, Y.; PAN, Y.; YAO, T.; CHEN, J.; MEI, T. Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 35, n. 10, p. 8518-8526, 2021. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/17034. Acesso em: 30 jun. 2022.