Henderson, P., W.-D. Chang, P.-L. Bacon, D. Meger, J. Pineau, and D. Precup. “OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1, Apr. 2018, doi:10.1609/aaai.v32i1.11775.