Zhou, Z., Shen, W., Chen, H., Tang, L., Chen, Y., & Zhang, Q. (2024). Batch Normalization Is Blind to the First and Second Derivatives of the Loss. Proceedings of the AAAI Conference on Artificial Intelligence, 38(18), 20010-20018. https://doi.org/10.1609/aaai.v38i18.29978