Zhang, G., and H. Kashima. “Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 37, no. 9, June 2023, pp. 11201-9, doi:10.1609/aaai.v37i9.26326.