[1]
Q. Cai, L. Pan, and P. Tang, “Deterministic Value-Policy Gradients”, AAAI, vol. 34, no. 04, pp. 3316-3323, Apr. 2020.