(1)
Huo, Z.; Gu, B.; Huang, H. Large Batch Optimization for Deep Learning Using New Complete Layer-Wise Adaptive Rate Scaling. AAAI 2021, 35, 7883-7890.