Passban, P., Wu, Y., Rezagholizadeh, M., & Liu, Q. (2021). ALP-KD: Attention-Based Layer Projection for Knowledge Distillation. Proceedings of the AAAI Conference on Artificial Intelligence, 35(15), 13657-13665. https://doi.org/10.1609/aaai.v35i15.17610