[1]

Z. Zhang, Y. Gan, and X. Tan, “Robust Action Gap Increasing with Clipped Advantage Learning”, AAAI, vol. 36, no. 8, pp. 9145–9152, Jun. 2022.