Ding, Z. (2024) “How to Trade Off the Quantity and Capacity of Teacher Ensemble: Learning Categorical Distribution to Stochastically Employ a Teacher for Distillation”, Proceedings of the AAAI Conference on Artificial Intelligence, 38(16), pp. 17915–17923. doi: 10.1609/aaai.v38i16.29746.