Gu, H., Hu, L., Niu, S., & Liu, F. (2026). FLRQ: Faster LLM Quantization with Flexible Low-Rank Matrix Sketching. Proceedings of the AAAI Conference on Artificial Intelligence, 40(26), 21369–21377. https://doi.org/10.1609/aaai.v40i26.39283