(1)
Yang, L.; Yang, J.; Ren, S. Contextual Bandits With Delayed Feedback and Semi-Supervised Learning (Student Abstract). AAAI 2021, 35, 15943-15944.