[1]
Chen, C.-Y., Choi, J., Brand, D., Agrawal, A., Zhang, W. and Gopalakrishnan, K. 2018. AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training. Proceedings of the AAAI Conference on Artificial Intelligence. 32, 1 (Apr. 2018). DOI:https://doi.org/10.1609/aaai.v32i1.11728.