[1]
Z. Yu, S. Li, and X. Zhang, “Language Model Distillation: A Temporal Difference Imitation Learning Perspective”, AAAI, vol. 40, no. 41, pp. 34512–34520, Mar. 2026.