[1]
D. Mankowitz, T. Mann, P.-L. Bacon, D. Precup, and S. Mannor, “Learning Robust Options”, AAAI, vol. 32, no. 1, Apr. 2018.