(1)

Zhang, Z.; Gan, Y.; Tan, X. Robust Action Gap Increasing With Clipped Advantage Learning. AAAI 2022, 36, 9145-9152.