(1)
Ding, Z.; Jiang, G.; Zhang, S.; Guo, L.; Lin, W. SKDBERT: Compressing BERT via Stochastic Knowledge Distillation. AAAI 2023, 37, 7414-7422.