Meta-Normalizing Flow for Data-Limited Offline Meta-Reinforcement Learning (Student Abstract)

Lianghui Liu; Zongzhang Zhang

doi:10.1609/aaai.v40i48.42241

Meta-Normalizing Flow for Data-Limited Offline Meta-Reinforcement Learning (Student Abstract)

Authors

Lianghui Liu National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, China
Zongzhang Zhang National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, China

DOI:

https://doi.org/10.1609/aaai.v40i48.42241

Abstract

Offline Meta-Reinforcement Learning (OMRL) leverages pre-collected data to adapt to new tasks. Context-based methods learn task representations from contexts. However, the context is influenced by both the task and the behavior policy. The mismatch between the behavior policy and the testing policy causes a context distribution shift problem, which results in poor task representations and degraded performance. This problem is exacerbated in settings with data limitations. To address this, we propose a novel approach called Meta-Normalizing Flow (Meta-NF). First, it employs a highly expressive and sample-efficient normalizing flow policy. Second, it incorporates a metric for testing-time task representation selection to effectively mitigate the context shift problem. Empirical results demonstrate that Meta-NF outperforms existing OMRL methods, with both components contributing to its strong performance.

AAAI-26 / IAAI-26 / EAAI-26 Proceedings Cover

Downloads

Published

2026-03-14

How to Cite

Liu, L., & Zhang, Z. (2026). Meta-Normalizing Flow for Data-Limited Offline Meta-Reinforcement Learning (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41278–41280. https://doi.org/10.1609/aaai.v40i48.42241

Download Citation

Issue

Vol. 40 No. 48: EAAI-26 AI for Education, Model AI Assignments, AAAI-26 Emerging Trends, Doctoral Consortium, Student Abstracts, Undergraduate Consortium and Demonstrations

Section

AAAI Student Abstract and Poster Program

Meta-Normalizing Flow for Data-Limited Offline Meta-Reinforcement Learning (Student Abstract)

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information