[1]
W. Wei, Y. Zhang, J. Liang, L. Li, and Y. Li, “Controlling Underestimation Bias in Reinforcement Learning via Quasi-median Operation”, AAAI, vol. 36, no. 8, pp. 8621-8628, Jun. 2022.