Long, A., Blair, A., & Hoof, H. van. (2022). Fast and Data Efficient Reinforcement Learning from Pixels via Non-parametric Value Approximation. Proceedings of the AAAI Conference on Artificial Intelligence, 36(7), 7620–7627. https://doi.org/10.1609/aaai.v36i7.20728