(1)
Gao, Z.; Xia, Q.; Xu, T.; Duan, X.; Zheng, Z.; Wang, Z.; Chen, E. Multi-Branch Self-Drafting for LLM Inference Acceleration. AAAI 2025, 39, 23942-23950.