[1]
H. Liao, “SparK: Query-Aware Unstructured Sparsity with Recoverable KV Cache Channel Pruning”, AAAI, vol. 40, no. 38, pp. 31961–31969, Mar. 2026.