Cai, Q., L. Pan, and P. Tang. ‚ÄúDeterministic Value-Policy Gradients‚ÄĚ. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 04, Apr. 2020, pp. 3316-23, doi:10.1609/aaai.v34i04.5732.