(1)
Li, Y.; Pan, Y.; Yao, T.; Chen, J.; Mei, T. Scheduled Sampling in Vision-Language Pretraining With Decoupled Encoder-Decoder Network. AAAI 2021, 35, 8518-8526.