Long, Alexander, Alan Blair, and Herke van Hoof. 2022. “Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation”. Proceedings of the AAAI Conference on Artificial Intelligence 36 (7):7620-27. https://doi.org/10.1609/aaai.v36i7.20728.