Zhang, G. and Kashima, H. (2023) “Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning”, Proceedings of the AAAI Conference on Artificial Intelligence, 37(9), pp. 11201-11209. doi: 10.1609/aaai.v37i9.26326.