1.
Li S, Zhang C. An Optimal Online Method of Selecting Source Policies for Reinforcement Learning. AAAI [Internet]. 2018Apr.29 [cited 2024Mar.29];32(1). Available from: https://ojs.aaai.org/index.php/AAAI/article/view/11718