[1]

A. Jain and V. Unhelkar, “GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction Estimation”, AAAI, vol. 38, no. 11, pp. 12763–12772, Mar. 2024.