Wang, J., S. Wang, R.-R. Chen, and M. Ji. “Demystifying Why Local Aggregation Helps: Convergence Analysis of Hierarchical SGD”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, no. 8, June 2022, pp. 8548-56, doi:10.1609/aaai.v36i8.20832.