[1]
Q. Cai, L. Pan, and P. Tang, ‚ÄúDeterministic Value-Policy Gradients‚ÄĚ, AAAI, vol. 34, no. 04, pp. 3316-3323, Apr. 2020.