[1]
Z. Zhou, W. Shen, H. Chen, L. Tang, Y. Chen, and Q. Zhang, “Batch Normalization Is Blind to the First and Second Derivatives of the Loss”, AAAI, vol. 38, no. 18, pp. 20010-20018, Mar. 2024.