Boosting Active Learning via Improving Test Performance

Authors

  • Tianyang Wang Austin Peay State University
  • Xingjian Li Baidu Research University of Macau
  • Pengkun Yang Tsinghua University
  • Guosheng Hu Oosto
  • Xiangrui Zeng Carnegie Mellon University
  • Siyu Huang Harvard University
  • Cheng-Zhong Xu University of Macau
  • Min Xu Carnegie Mellon University

DOI:

https://doi.org/10.1609/aaai.v36i8.20834

Keywords:

Machine Learning (ML), Computer Vision (CV)

Abstract

Central to active learning (AL) is what data should be selected for annotation. Existing works attempt to select highly uncertain or informative data for annotation. Nevertheless, it remains unclear how selected data impacts the test performance of the task model used in AL. In this work, we explore such an impact by theoretically proving that selecting unlabeled data of higher gradient norm leads to a lower upper-bound of test loss, resulting in a better test performance. However, due to the lack of label information, directly computing gradient norm for unlabeled data is infeasible. To address this challenge, we propose two schemes, namely expected-gradnorm and entropy-gradnorm. The former computes the gradient norm by constructing an expected empirical loss while the latter constructs an unsupervised loss with entropy. Furthermore, we integrate the two schemes in a universal AL framework. We evaluate our method on classical image classification and semantic segmentation tasks. To demonstrate its competency in domain applications and its robustness to noise, we also validate our method on a cellular imaging analysis task, namely cryo-Electron Tomography subtomogram classification. Results demonstrate that our method achieves superior performance against the state of the art. We refer readers to https://arxiv.org/pdf/2112.05683.pdf for the full version of this paper which includes the appendix and source code link.

Downloads

Published

2022-06-28

How to Cite

Wang, T., Li, X., Yang, P., Hu, G., Zeng, X., Huang, S., Xu, C.-Z., & Xu, M. (2022). Boosting Active Learning via Improving Test Performance. Proceedings of the AAAI Conference on Artificial Intelligence, 36(8), 8566-8574. https://doi.org/10.1609/aaai.v36i8.20834

Issue

Section

AAAI Technical Track on Machine Learning III