Zhang, Zhaoyang, Wenqi Shao, Yixiao Ge, Xiaogang Wang, Jinwei Gu, and Ping Luo. 2024. “Cached Transformers: Improving Transformers With Differentiable Memory Cachde”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (15):16935-43. https://doi.org/10.1609/aaai.v38i15.29636.