Yang, Xu, et al. “Self-Indexing KVCache: Predicting Sparse Attention from Compressed Keys”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 33, Mar. 2026, pp. 27675-83, doi:10.1609/aaai.v40i33.39988.