1.
Ding Y, Jin M, Lavaei J. Non-stationary Risk-Sensitive Reinforcement Learning: Near-Optimal Dynamic Regret, Adaptive Detection, and Separation Design. AAAI [Internet]. 2023Jun.26 [cited 2024Jul.28];37(6):7405-13. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/25901