OMURA, M.; OSA, T.; MUKUTA, Y.; HARADA, T. Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 38, n. 13, p. 14474-14481, 2024. DOI: 10.1609/aaai.v38i13.29362. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/29362. Acesso em: 1 sep. 2024.