Guo, T. (2026) “SPEED-Q: Staged Processing with Enhanced Distillation Towards Efficient Low-Bit On-Device VLM Quantization”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(26), pp. 21486–21494. doi: 10.1609/aaai.v40i26.39296.