Lobel, Samuel, Sreehari Rammohan, Bowen He, Shangqun Yu, and George Konidaris. “Q-Functionals for Value-Based Continuous Control”. Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 7 (June 26, 2023): 8932-8939. Accessed April 23, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/26073.