[1]

Henderson, P., Chang, W.-D., Bacon, P.-L., Meger, D., Pineau, J. and Precup, D. 2018. OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence. 32, 1 (Apr. 2018). DOI:https://doi.org/10.1609/aaai.v32i1.11775.