Xu, C., & McAuley, J. (2023). A Survey on Model Compression and Acceleration for Pretrained Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 37(9), 10566–10575. https://doi.org/10.1609/aaai.v37i9.26255