WANG, Pichao; WANG, Xue; LUO, Hao; ZHOU, Jingkai; ZHOU, Zhipeng; WANG, Fan; LI, Hao; JIN, Rong. Scaled ReLU Matters for Training Vision Transformers. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 36, n. 3, p. 2495–2503, 2022. DOI: 10.1609/aaai.v36i3.20150. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/20150. Acesso em: 23 may. 2026.