(1)
Vieillard, N.; Pietquin, O.; Geist, M. Deep Conservative Policy Iteration. AAAI 2020, 34, 6070-6077.