(1)

Jiang, H.; Xie, J.; Yang, J. Action Candidate Based Clipped Double Q-Learning for Discrete and Continuous Action Tasks. AAAI 2021, 35, 7979-7986.