Melo, F. (2011). Differential Eligibility Vectors for Advantage Updating and Gradient Methods. Proceedings of the AAAI Conference on Artificial Intelligence, 25(1), 441-446. https://doi.org/10.1609/aaai.v25i1.7938