MUTTI, M.; PRATISSOLI, L.; RESTELLI, M. Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 35, n. 10, p. 9028-9036, 2021. DOI: 10.1609/aaai.v35i10.17091. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/17091. Acesso em: 23 apr. 2024.