Gao, Z., Xia, Q., Xu, T., Duan, X., Zheng, Z., Wang, Z. and Chen, E. (2025) “Multi-Branch Self-Drafting for LLM Inference Acceleration”, Proceedings of the AAAI Conference on Artificial Intelligence, 39(22), pp. 23942-23950. doi: 10.1609/aaai.v39i22.34567.