Mutti, M., L. Pratissoli, and M. Restelli. “Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 10, May 2021, pp. 9028-36, https://ojs.aaai.org/index.php/AAAI/article/view/17091.