Yao, Hengshuai, and Csaba Szepesvari. “Approximate Policy Iteration With Linear Action Models”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 26, no. 1, Sept. 2021, pp. 1212-8, doi:10.1609/aaai.v26i1.8319.