Long, Alexander, Alan Blair, and Herke van Hoof. “Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 7 (June 28, 2022): 7620-7627. Accessed April 19, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/20728.