Hallak, A., A. Tamar, R. Munos, and S. Mannor. “Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 30, no. 1, Feb. 2016, doi:10.1609/aaai.v30i1.10227.