Xu, Canwen, and Julian McAuley. 2023. “A Survey on Model Compression and Acceleration for Pretrained Language Models”. Proceedings of the AAAI Conference on Artificial Intelligence 37 (9):10566-75. https://doi.org/10.1609/aaai.v37i9.26255.