Wang, Pichao, Xue Wang, Hao Luo, Jingkai Zhou, Zhipeng Zhou, Fan Wang, Hao Li, and Rong Jin. “Scaled ReLU Matters for Training Vision Transformers”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 3 (June 28, 2022): 2495-2503. Accessed April 18, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/20150.