Zeng, Yujie, Wenlong He, Ihor Vasyltsov, Jiali Pang, and Lin Chen. 2023. “Acceleration of Large Transformer Model Training by Sensitivity-Based Layer Dropping”. Proceedings of the AAAI Conference on Artificial Intelligence 37 (9):11156-63. https://doi.org/10.1609/aaai.v37i9.26321.