Xu, Canwen, and Julian McAuley. “A Survey on Model Compression and Acceleration for Pretrained Language Models”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 37, no. 9, June 2023, pp. 10566-75, doi:10.1609/aaai.v37i9.26255.