Omura, Motoki, Takayuki Osa, Yusuke Mukuta, and Tatsuya Harada. 2024. “Symmetric Q-Learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (13):14474-81. https://doi.org/10.1609/aaai.v38i13.29362.