Bulatov, A., Kuratov, Y., Kapushev, Y., & Burtsev, M. (2024). Beyond Attention: Breaking the Limits of Transformer Context Length with Recurrent Memory. Proceedings of the AAAI Conference on Artificial Intelligence, 38(16), 17700–17708. https://doi.org/10.1609/aaai.v38i16.29722