Li, B., Z. Wang, H. Liu, Q. Du, T. Xiao, C. Zhang, and J. Zhu. “Learning Light-Weight Translation Models from Deep Transformer”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 15, May 2021, pp. 13217-25, https://ojs.aaai.org/index.php/AAAI/article/view/17561.