Wang, P., Wang, X., Luo, H., Zhou, J., Zhou, Z., Wang, F., Li, H., & Jin, R. (2022). Scaled ReLU Matters for Training Vision Transformers. Proceedings of the AAAI Conference on Artificial Intelligence, 36(3), 2495-2503. https://doi.org/10.1609/aaai.v36i3.20150