Zeng, Z., Hong, Y., Dai, H., Zhuang, H., & Chen, C. (2024). ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 19506–19514. https://doi.org/10.1609/aaai.v38i17.29922