[1]
W. Kim, D. Ki, and B.-J. Lee, “Relaxed Stationary Distribution Correction Estimation for Improved Offline Policy Optimization”, AAAI, vol. 38, no. 12, pp. 13185-13192, Mar. 2024.