Enhancing Semi-supervised Domain Adaptation via Effective Target Labeling

Jiujun He; Bin Liu; Guosheng Yin

doi:10.1609/aaai.v38i11.29130

Authors

Jiujun He Center of Statistical Research, School of Statistics, Southwestern University of Finance and Economics, Chengdu, China
Bin Liu Center of Statistical Research, School of Statistics, Southwestern University of Finance and Economics, Chengdu, China
Guosheng Yin Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong, China

DOI:

https://doi.org/10.1609/aaai.v38i11.29130

Keywords:

ML: Transfer, Domain Adaptation, Multi-Task Learning, ML: Active Learning, ML: Semi-Supervised Learning

Abstract

Existing semi-supervised domain adaptation (SSDA) models have exhibited impressive performance on the target domain by effectively utilizing few labeled target samples per class (e.g., 3 samples per class). To guarantee an equal number of labeled target samples for each class, however, they require domain experts to manually recognize a considerable amount of the unlabeled target data. Moreover, as the target samples are not equally informative for shaping the decision boundaries of the learning models, it is crucial to select the most informative target samples for labeling, which is, however, impossible for human selectors. As a remedy, we propose an EFfective Target Labeling (EFTL) framework that harnesses active learning and pseudo-labeling strategies to automatically select some informative target samples to annotate. Concretely, we introduce a novel sample query strategy, called non-maximal degree node suppression (NDNS), that iteratively performs maximal degree node query and non-maximal degree node removal to select representative and diverse target samples for labeling. To learn target-specific characteristics, we propose a novel pseudo-labeling strategy that attempts to label low-confidence target samples accurately via clustering consistency (CC), and then inject information of the model uncertainty into our query process. CC enhances the utilization of the annotation budget and increases the number of “labeled” target samples while requiring no additional manual effort. Our proposed EFTL framework can be easily coupled with existing SSDA models, showing significant improvements on three benchmarks

Enhancing Semi-supervised Domain Adaptation via Effective Target Labeling

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information