Cai, Qingpeng, Ling Pan, and Pingzhong Tang. 2020. “Deterministic Value-Policy Gradients”. Proceedings of the AAAI Conference on Artificial Intelligence 34 (04):3316-23. https://doi.org/10.1609/aaai.v34i04.5732.