Xia, Y., & Sun, F. (2026). Behavior Regularization with Flow Latent Policy for Offline Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(32), 27028–27036. https://doi.org/10.1609/aaai.v40i32.39916