Dharmavaram, Akshay, Matthew Riemer, and Shalabh Bhatnagar. “Hierarchical Average Reward Policy Gradient Algorithms (Student Abstract)”. Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 10 (April 3, 2020): 13777–13778. Accessed May 25, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/7160.