[1]
Z. Zeng, Y. Hong, H. Dai, H. Zhuang, and C. Chen, “ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference”, AAAI, vol. 38, no. 17, pp. 19506–19514, Mar. 2024.