Dharmavaram, A., M. Riemer, and S. Bhatnagar. “Hierarchical Average Reward Policy Gradient Algorithms (Student Abstract)”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 10, Apr. 2020, pp. 13777-8, doi:10.1609/aaai.v34i10.7160.