Ding, Y., Jin, M., & Lavaei, J. (2023). Non-stationary Risk-Sensitive Reinforcement Learning: Near-Optimal Dynamic Regret, Adaptive Detection, and Separation Design. Proceedings of the AAAI Conference on Artificial Intelligence, 37(6), 7405-7413. https://doi.org/10.1609/aaai.v37i6.25901