[1]

Ding, Y., Jin, M. and Lavaei, J. 2023. Non-stationary Risk-Sensitive Reinforcement Learning: Near-Optimal Dynamic Regret, Adaptive Detection, and Separation Design. Proceedings of the AAAI Conference on Artificial Intelligence. 37, 6 (Jun. 2023), 7405-7413. DOI:https://doi.org/10.1609/aaai.v37i6.25901.