Guo, Tianyu, Shanwei Zhao, Shiai Zhu, and Chenguang Ma. 2026. “SPEED-Q: Staged Processing With Enhanced Distillation Towards Efficient Low-Bit On-Device VLM Quantization”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (26):21486-94. https://doi.org/10.1609/aaai.v40i26.39296.