(1)
Yang, X.; Zhang, J.; Zhao, D.; Chen, G.; Tang, Z. Self-Indexing KVCache: Predicting Sparse Attention from Compressed Keys. AAAI 2026, 40, 27675-27683.