Long, Alexander, Alan Blair, and Herke van Hoof. “Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 7 (June 28, 2022): 7620–7627. Accessed May 13, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/20728.