DABNEY, Will; BARRETO, André; ROWLAND, Mark; DADASHI, Robert; QUAN, John; G. BELLEMARE, Marc; SILVER, David. The Value-Improvement Path: Towards Better Representations for Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 35, n. 8, p. 7160–7168, 2021. DOI: 10.1609/aaai.v35i8.16880. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/16880. Acesso em: 28 may. 2026.