Generalized Category Discovery with Decoupled Prototypical Network

Authors

  • Wenbin An School of Automation Science and Engineering, Xi'an Jiaotong University National Engineering Laboratory for Big Data Analytics
  • Feng Tian School of Computer Science and Technology, Xi'an Jiaotong University National Engineering Laboratory for Big Data Analytics
  • Qinghua Zheng School of Computer Science and Technology, Xi'an Jiaotong University National Engineering Laboratory for Big Data Analytics
  • Wei Ding Department of Computer Science, University of Massachusetts Boston
  • Qianying Wang Lenovo Research
  • Ping Chen Department of Engineering, University of Massachusetts Boston

DOI:

https://doi.org/10.1609/aaai.v37i11.26475

Keywords:

SNLP: Text Mining, SNLP: Text Classification

Abstract

Generalized Category Discovery (GCD) aims to recognize both known and novel categories from a set of unlabeled data, based on another dataset labeled with only known categories. Without considering differences between known and novel categories, current methods learn about them in a coupled manner, which can hurt model's generalization and discriminative ability. Furthermore, the coupled training approach prevents these models transferring category-specific knowledge explicitly from labeled data to unlabeled data, which can lose high-level semantic information and impair model performance. To mitigate above limitations, we present a novel model called Decoupled Prototypical Network (DPN). By formulating a bipartite matching problem for category prototypes, DPN can not only decouple known and novel categories to achieve different training targets effectively, but also align known categories in labeled and unlabeled data to transfer category-specific knowledge explicitly and capture high-level semantics. Furthermore, DPN can learn more discriminative features for both known and novel categories through our proposed Semantic-aware Prototypical Learning (SPL). Besides capturing meaningful semantic information, SPL can also alleviate the noise of hard pseudo labels through semantic-weighted soft assignment. Extensive experiments show that DPN outperforms state-of-the-art models by a large margin on all evaluation metrics across multiple benchmark datasets. Code and data are available at https://github.com/Lackel/DPN.

Downloads

Published

2023-06-26

How to Cite

An, W., Tian, F., Zheng, Q., Ding, W., Wang, Q., & Chen, P. (2023). Generalized Category Discovery with Decoupled Prototypical Network. Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), 12527-12535. https://doi.org/10.1609/aaai.v37i11.26475

Issue

Section

AAAI Technical Track on Speech & Natural Language Processing