Chen, C.-Y., J. Choi, D. Brand, A. Agrawal, W. Zhang, and K. Gopalakrishnan. “AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1, Apr. 2018, doi:10.1609/aaai.v32i1.11728.