[1]
G. Zhang and H. Kashima, “Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning”, AAAI, vol. 37, no. 9, pp. 11201-11209, Jun. 2023.