(1)

Zhang, G.; Kashima, H. Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning. AAAI 2023, 37, 11201-11209.