Learning to Reweight with Deep Interactions

Authors

  • Yang Fan University of Science and Technology of China
  • Yingce Xia Microsoft Research Asia
  • Lijun Wu Microsoft Research Asia
  • Shufang Xie Microsoft Research Asia
  • Weiqing Liu Microsoft Research Asia
  • Jiang Bian Microsoft Research Asia
  • Tao Qin Microsoft Research Asia
  • Xiang-Yang Li University of Science and Technology of China

DOI:

https://doi.org/10.1609/aaai.v35i8.16906

Keywords:

(Deep) Neural Network Algorithms

Abstract

Recently the concept of teaching has been introduced into machine learning, in which a teacher model is used to guide the training of a student model (which will be used in real tasks) through data selection, loss function design, etc. Learning to reweight, which is a specific kind of teaching that reweights training data using a teacher model, receives much attention due to its simplicity and effectiveness. In existing learning to reweight works, the teacher model only utilizes shallow/surface information such as training iteration number and loss/accuracy of the student model from training/validation sets, but ignores the internal states of the student model, which limits the potential of learning to reweight. In this work, we propose an improved data reweighting algorithm, in which the student model provides its internal states to the teacher model, and the teacher model returns adaptive weights of training samples to enhance the training of the student model. The teacher model is jointly trained with the student model using meta gradients propagated from a validation set. Experiments on image classification with clean/noisy labels and neural machine translation empirically demonstrate that our algorithm makes significant improvement over previous methods.

Downloads

Published

2021-05-18

How to Cite

Fan, Y., Xia, Y., Wu, L., Xie, S., Liu, W., Bian, J., Qin, T., & Li, X.-Y. (2021). Learning to Reweight with Deep Interactions. Proceedings of the AAAI Conference on Artificial Intelligence, 35(8), 7385-7393. https://doi.org/10.1609/aaai.v35i8.16906

Issue

Section

AAAI Technical Track on Machine Learning I