[1]
Harutyunyan, A., Vrancx, P., Bacon, P.-L., Precup, D. and Nowé, A. 2018. Learning With Options That Terminate Off-Policy. Proceedings of the AAAI Conference on Artificial Intelligence. 32, 1 (Apr. 2018). DOI:https://doi.org/10.1609/aaai.v32i1.11740.