Gurioli, A., Pennino, F., Monteiro, J. and Gabbrielli, M. (2026) “MoSE: Hierarchical Self-Distillation Enhances Early Layer Embeddings”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(37), pp. 30897-30906. doi: 10.1609/aaai.v40i37.40348.