Li, Yehao, Yingwei Pan, Ting Yao, Jingwen Chen, and Tao Mei. 2021. “Scheduled Sampling in Vision-Language Pretraining With Decoupled Encoder-Decoder Network”. Proceedings of the AAAI Conference on Artificial Intelligence 35 (10):8518-26. https://doi.org/10.1609/aaai.v35i10.17034.