[1]
T. Guo, S. Zhao, S. Zhu, and C. Ma, “SPEED-Q: Staged Processing with Enhanced Distillation Towards Efficient Low-Bit On-Device VLM Quantization”, AAAI, vol. 40, no. 26, pp. 21486–21494, Mar. 2026.