Wang, M., Li, X., Wang, M., & Bennis, H. (2026). Offline Meta-Reinforcement Learning with Flow-Based Task Inference and Adaptive Correction of Feature Overgeneralization. Proceedings of the AAAI Conference on Artificial Intelligence, 40(31), 26390–26397. https://doi.org/10.1609/aaai.v40i31.39845