(1)

Li, S.; Zhang, C. An Optimal Online Method of Selecting Source Policies for Reinforcement Learning. AAAI 2018, 32.