[1]
J. Jiang, X. Zhou, J. Li, G. Han, X. Shi, and F. Deng, “Hierarchical Reinforcement Learning with Topology-Aware Exploration Framework for Multi-path Commodity Flow Problem”, AAAI, vol. 40, no. 43, pp. 36280–36288, Mar. 2026.