Dharmavaram, Akshay, Matthew Riemer, and Shalabh Bhatnagar. “Hierarchical Average Reward Policy Gradient Algorithms (Student Abstract)”. Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 10 (April 3, 2020): 13777-13778. Accessed July 14, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/7160.