1.
Henderson P, Chang W-D, Bacon P-L, Meger D, Pineau J, Precup D. OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning. AAAI [Internet]. 2018Apr.29 [cited 2024Jul.24];32(1). Available from: https://ojs.aaai.org/index.php/AAAI/article/view/11775