Convex Batch Mode Active Sampling via α-Relative Pearson Divergence

Hanmo Wang; Liang Du; Peng Zhou; Lei Shi; Yi-Dong Shen

doi:10.1609/aaai.v29i1.9618

Authors

Hanmo Wang Chinese Academy of Sciences
Liang Du Chinese Academy of Sciences
Peng Zhou Chinese Academy of Sciences
Lei Shi Chinese Academy of Sciences
Yi-Dong Shen Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v29i1.9618

Keywords:

Active Learning, Batch Mode Active Learning, alpha-relative Pearson divergence, RPE, distribution comparison, maximum mean discrepancy, MMD, convex, minimax, min-max

Abstract

Active learning is a machine learning technique that trains a classifier after selecting a subset from an unlabeled dataset for labeling and using the selected data for training. Recently, batch mode active learning, which selects a batch of samples to label in parallel, has attracted a lot of attention. Its challenge lies in the choice of criteria used for guiding the search of the optimal batch. In this paper, we propose a novel approach to selecting the optimal batch of queries by minimizing the α-relative Pearson divergence (RPE) between the labeled and the original datasets. This particular divergence is chosen since it can distinguish the optimal batch more easily than other measures especially when available candidates are similar. The proposed objective is a min-max optimization problem, and it is difficult to solve due to the involvement of both minimization and maximization. We find that the objective has an equivalent convex form, and thus a global optimal solution can be obtained. Then the subgradient method can be applied to solve the simplified convex problem. Our empirical studies on UCI datasets demonstrate the effectiveness of the proposed approach compared with the state-of-the-art batch mode active learning methods.

Convex Batch Mode Active Sampling via α-Relative Pearson Divergence

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information