Foerster, J., G. Farquhar, T. Afouras, N. Nardelli, and S. Whiteson. “Counterfactual Multi-Agent Policy Gradients”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1, Apr. 2018, doi:10.1609/aaai.v32i1.11794.