Passban, P., Y. Wu, M. Rezagholizadeh, and Q. Liu. “ALP-KD: Attention-Based Layer Projection for Knowledge Distillation”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 15, May 2021, pp. 13657-65, doi:10.1609/aaai.v35i15.17610.