Henderson, P., Chang, W.-D., Bacon, P.-L., Meger, D., Pineau, J. and Precup, D. (2018) “OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning”, Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). doi: 10.1609/aaai.v32i1.11775.