[1]
Omura, M., Osa, T., Mukuta, Y. and Harada, T. 2024. Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence. 38, 13 (Mar. 2024), 14474-14481. DOI:https://doi.org/10.1609/aaai.v38i13.29362.