Li, S., & Zhang, C. (2018). An Optimal Online Method of Selecting Source Policies for Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). https://doi.org/10.1609/aaai.v32i1.11718