Li, B., Wang, Z., Liu, H., Du, Q., Xiao, T., Zhang, C., & Zhu, J. (2021). Learning Light-Weight Translation Models from Deep Transformer. Proceedings of the AAAI Conference on Artificial Intelligence, 35(15), 13217-13225. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/17561