[1]
Z. Zhang, Y. Gan, and X. Tan, “Robust Action Gap Increasing with Clipped Advantage Learning”, AAAI, vol. 36, no. 8, pp. 9145-9152, Jun. 2022.