Gao, Zipeng, Qingrong Xia, Tong Xu, Xinyu Duan, Zhi Zheng, Zhefeng Wang, and Enhong Chen. “Multi-Branch Self-Drafting for LLM Inference Acceleration”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 22 (April 11, 2025): 23942-23950. Accessed April 23, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/34567.