Pan L. Towards Robust, Efficient, and Practical Decision-Making: From Reward-Maximizing Deep Reinforcement Learning to Reward-Matching GFlowNets. AAAI [Internet]. 2025 Apr. 11 [cited 2026 May 13];39(27):28724-. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/35118