(1)
Wu, X.; Xie, Y.; Du, S. S.; Ward, R. AdaLoss: A Computationally-Efficient and Provably Convergent Adaptive Gradient Method. AAAI 2022, 36, 8691-8699.