[1]
M. Wang, X. Li, M. Wang, and H. Bennis, “Offline Meta-Reinforcement Learning with Flow-Based Task Inference and Adaptive Correction of Feature Overgeneralization”, AAAI, vol. 40, no. 31, pp. 26390–26397, Mar. 2026.