Gu, H. (2026) “FLRQ: Faster LLM Quantization with Flexible Low-Rank Matrix Sketching”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(26), pp. 21369–21377. doi: 10.1609/aaai.v40i26.39283.