1.
Zhang C, Li Y, Li J. Policy Search by Target Distribution Learning for Continuous Control. AAAI [Internet]. 2020Apr.3 [cited 2022Aug.14];34(04):6770-7. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/6156