Liu, Y., Ge, J., Li, C., & Gui, J. (2021). Delving into Variance Transmission and Normalization: Shift of Average Gradient Makes the Network Collapse. Proceedings of the AAAI Conference on Artificial Intelligence, 35(3), 2216-2224. https://doi.org/10.1609/aaai.v35i3.16320