Qin, Zongyue, Zifan He, Neha Prakriya, Jason Cong, and Yizhou Sun. 2025. “Dynamic-Width Speculative Beam Decoding for LLM Inference”. Proceedings of the AAAI Conference on Artificial Intelligence 39 (23):25056-64. https://doi.org/10.1609/aaai.v39i23.34690.