Dordevic, Danilo, Vukasin Bozic, Joseph Thommes, Daniele Coppola, and Sidak Pal Singh. 2024. “Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks As an Alternative to Attention Layers in Transformers (Student Abstract)”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (21):23477-79. https://doi.org/10.1609/aaai.v38i21.30436.