Dordevic, D. (2024) “Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers (Student Abstract)”, Proceedings of the AAAI Conference on Artificial Intelligence, 38(21), pp. 23477–23479. doi: 10.1609/aaai.v38i21.30436.