Lupu, A., A. Durand, and D. Precup. “Leveraging Observations in Bandits: Between Risks and Benefits”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 01, July 2019, pp. 6112-9, doi:10.1609/aaai.v33i01.33016112.