[1]
Cai, Q., Pan, L. and Tang, P. 2020. Deterministic Value-Policy Gradients. Proceedings of the AAAI Conference on Artificial Intelligence. 34, 04 (Apr. 2020), 3316-3323. DOI:https://doi.org/10.1609/aaai.v34i04.5732.