Harutyunyan, A., P. Vrancx, P.-L. Bacon, D. Precup, and A. Nowé. “Learning With Options That Terminate Off-Policy”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1, Apr. 2018, doi:10.1609/aaai.v32i1.11740.