(1)
Ding, Z.; Jiang, G.; Zhang, S.; Guo, L.; Lin, W. How to Trade Off the Quantity and Capacity of Teacher Ensemble: Learning Categorical Distribution to Stochastically Employ a Teacher for Distillation. AAAI 2024, 38, 17915-17923.