1.
Hester T, Vecerik M, Pietquin O, Lanctot M, Schaul T, Piot B, Horgan D, Quan J, Sendonaris A, Osband I, Dulac-Arnold G, Agapiou J, Leibo J, Gruslys A. Deep Q-learning From Demonstrations. AAAI [Internet]. 2018Apr.29 [cited 2024Apr.19];32(1). Available from: https://ojs.aaai.org/index.php/AAAI/article/view/11757