Dordevic, Danilo, et al. “Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks As an Alternative to Attention Layers in Transformers (Student Abstract)”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 38, no. 21, Mar. 2024, pp. 23477-9, doi:10.1609/aaai.v38i21.30436.