Kuang, Yufei, Miao Lu, Jie Wang, Qi Zhou, Bin Li, and Houqiang Li. 2022. “Learning Robust Policy Against Disturbance in Transition Dynamics via State-Conservative Policy Optimization”. Proceedings of the AAAI Conference on Artificial Intelligence 36 (7):7247-54. https://doi.org/10.1609/aaai.v36i7.20686.