1.
Liao H, Xu Y, He S, Li G, Yin X, Li D, et al. SparK: Query-Aware Unstructured Sparsity with Recoverable KV Cache Channel Pruning. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 16];40(38):31961-9. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/40466