Mutti M, Pratissoli L, Restelli M. Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate. AAAI [Internet]. 2021May18 [cited 2024Apr.25];35(10):9028-36. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/17091