1.
Zhang Z, Shao W, Ge Y, Wang X, Gu J, Luo P. Cached Transformers: Improving Transformers with Differentiable Memory Cachde. AAAI [Internet]. 2024 Mar. 24 [cited 2026 May 26];38(15):16935-43. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/29636