DORDEVIC, Danilo; BOZIC, Vukasin; THOMMES, Joseph; COPPOLA, Daniele; PAL SINGH, Sidak. Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 38, n. 21, p. 23477–23479, 2024. DOI: 10.1609/aaai.v38i21.30436. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/30436. Acesso em: 29 may. 2026.