Inconsistency-Based Data-Centric Active Open-Set Annotation

Authors

  • Ruiyu Mao University of Texas at Dallas
  • Ouyang Xu University of Texas at Dallas
  • Yunhui Guo University of Texas at Dallas

DOI:

https://doi.org/10.1609/aaai.v38i5.28213

Keywords:

CV: Applications, CV: Object Detection & Categorization

Abstract

Active learning, a method to reduce labeling effort for training deep neural networks, is often limited by the assumption that all unlabeled data belong to known classes. This closed-world assumption fails in practical scenarios with unknown classes in the data, leading to active open-set annotation challenges. Existing methods struggle with this uncertainty. We introduce NEAT, a novel, computationally efficient, data-centric active learning approach for open-set data. NEAT differentiates and labels known classes from a mix of known and unknown classes, using a clusterability criterion and a consistency mea- sure that detects inconsistencies between model predictions and feature distribution. In contrast to recent learning-centric solutions, NEAT shows superior performance in active open- set annotation, as our experiments confirm. Additional details on the further evaluation metrics, implementation, and archi- tecture of our method can be found in the public document at https://arxiv.org/pdf/2401.04923.pdf.

Published

2024-03-24

How to Cite

Mao, R., Xu, O., & Guo, Y. (2024). Inconsistency-Based Data-Centric Active Open-Set Annotation. Proceedings of the AAAI Conference on Artificial Intelligence, 38(5), 4180-4188. https://doi.org/10.1609/aaai.v38i5.28213

Issue

Section

AAAI Technical Track on Computer Vision IV