DING, Y.; JIN, M.; LAVAEI, J. Non-stationary Risk-Sensitive Reinforcement Learning: Near-Optimal Dynamic Regret, Adaptive Detection, and Separation Design. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 37, n. 6, p. 7405-7413, 2023. DOI: 10.1609/aaai.v37i6.25901. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/25901. Acesso em: 28 jul. 2024.