Liu, Y., Ge, J., Li, C., & Gui, J. (2021). Delving into Variance Transmission and Normalization: Shift of Average Gradient Makes the Network Collapse. Proceedings of the AAAI Conference on Artificial Intelligence, 35(3), 2216-2224. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/16320