Semantic-Guided Novel Category Discovery

Authors

  • Weishuai Wang Wangxuan Institute of Computer Technology, Peking University
  • Ting Lei Wangxuan Institute of Computer Technology, Peking University
  • Qingchao Chen National Institute of Health Data Science, Peking University
  • Yang Liu Wangxuan Institute of Computer Technology, Peking University

DOI:

https://doi.org/10.1609/aaai.v38i6.28371

Keywords:

CV: Object Detection & Categorization, CV: Representation Learning for Vision

Abstract

The Novel Category Discovery problem aims to cluster an unlabeled set with the help of a labeled set consisting of disjoint but related classes. However, existing models treat class names as discrete one-hot labels and ignore the semantic understanding of these classes. In this paper, we propose a new setting named Semantic-guided Novel Category Discovery (SNCD), which requires the model to not only cluster the unlabeled images but also semantically recognize these images based on a set of their class names. The first challenge we confront pertains to effectively leveraging the class names of unlabeled images, given the inherent gap between the visual and linguistic domains. To address this issue, we incorporate a semantic-aware recognition mechanism. This is achieved by constructing dynamic class-wise visual prototypes as well as a semantic similarity matrix that enables the projection of visual features into the semantic space. The second challenge originates from the granularity disparity between the classification and clustering tasks. To deal with this, we develop a semantic-aware clustering process to facilitate the exchange of knowledge between the two tasks. Through extensive experiments, we demonstrate the mutual benefits of the recognition and clustering tasks, which can be jointly optimized. Experimental results on multiple datasets confirm the effectiveness of our proposed method. Our code is available at https://github.com/wang-weishuai/Semantic-guided-NCD.

Published

2024-03-24

How to Cite

Wang, W., Lei, T., Chen, Q., & Liu, Y. (2024). Semantic-Guided Novel Category Discovery. Proceedings of the AAAI Conference on Artificial Intelligence, 38(6), 5607–5614. https://doi.org/10.1609/aaai.v38i6.28371

Issue

Section

AAAI Technical Track on Computer Vision V