(1)
Gu, H.; Hu, L.; Niu, S.; Liu, F. FLRQ: Faster LLM Quantization With Flexible Low-Rank Matrix Sketching. AAAI 2026, 40, 21369-21377.