[1]
Dordevic, D. et al. 2024. Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence. 38, 21 (Mar. 2024), 23477–23479. DOI:https://doi.org/10.1609/aaai.v38i21.30436.