[1]
X. Yang, J. Zhang, D. Zhao, G. Chen, and Z. Tang, “Self-Indexing KVCache: Predicting Sparse Attention from Compressed Keys”, AAAI, vol. 40, no. 33, pp. 27675–27683, Mar. 2026.