Yang, X., Zhang, J., Zhao, D., Chen, G., & Tang, Z. (2026). Self-Indexing KVCache: Predicting Sparse Attention from Compressed Keys. Proceedings of the AAAI Conference on Artificial Intelligence, 40(33), 27675–27683. https://doi.org/10.1609/aaai.v40i33.39988