ZENG, Yujie; HE, Wenlong; VASYLTSOV, Ihor; PANG, Jiali; CHEN, Lin. Acceleration of Large Transformer Model Training by Sensitivity-Based Layer Dropping. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 37, n. 9, p. 11156–11163, 2023. DOI: 10.1609/aaai.v37i9.26321. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/26321. Acesso em: 25 may. 2026.