(1)
Cai, Q.; Pan, L.; Tang, P. Deterministic Value-Policy Gradients. AAAI 2020, 34, 3316-3323.