Zhang, Z. (2024) “Cached Transformers: Improving Transformers with Differentiable Memory Cachde”, Proceedings of the AAAI Conference on Artificial Intelligence, 38(15), pp. 16935–16943. doi: 10.1609/aaai.v38i15.29636.