1.
Wei W, Zhang Y, Liang J, Li L, Li Y. Controlling Underestimation Bias in Reinforcement Learning via Quasi-median Operation. AAAI [Internet]. 2022Jun.28 [cited 2024Jul.25];36(8):8621-8. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/20840