Tan, X., Li, H., Wang, L., Huang, X., & Xu, Z. (2021). Empowering Adaptive Early-Exit Inference with Latency Awareness. Proceedings of the AAAI Conference on Artificial Intelligence, 35(11), 9825-9833. https://doi.org/10.1609/aaai.v35i11.17181