[1]
Lupu, A., Durand, A. and Precup, D. 2019. Leveraging Observations in Bandits: Between Risks and Benefits. Proceedings of the AAAI Conference on Artificial Intelligence. 33, 01 (Jul. 2019), 6112-6119. DOI:https://doi.org/10.1609/aaai.v33i01.33016112.