Cai, Q., Pan, L. and Tang, P. (2020) “Deterministic Value-Policy Gradients”, Proceedings of the AAAI Conference on Artificial Intelligence, 34(04), pp. 3316-3323. doi: 10.1609/aaai.v34i04.5732.