Omura, M., Osa, T., Mukuta, Y. and Harada, T. (2024) “Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning”, Proceedings of the AAAI Conference on Artificial Intelligence, 38(13), pp. 14474-14481. doi: 10.1609/aaai.v38i13.29362.