Cai, Q., L. Pan, and P. Tang. “Deterministic Value-Policy Gradients”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 04, Apr. 2020, pp. 3316-23, doi:10.1609/aaai.v34i04.5732.