Learning Intrinsic Hierarchy for Generalized Category Discovery

Authors

  • Yu Duan Xidian University
  • Junzhi He Northwest Polytechnical University
  • Zhanxuan Hu Northwest Polytechnical University
  • Mengda Ji Yunnan Normal University
  • Rong Wang Northwest Polytechnical University
  • Quanxue Gao Xidian University

DOI:

https://doi.org/10.1609/aaai.v40i25.39236

Abstract

Generalized Category Discovery (GCD) aims to classify unlabeled data by leveraging knowledge from labeled categories. While existing methods have achieved remarkable progress, they often treat images as flat feature sets, neglecting the intrinsic hierarchy: where key objects dominate meaning and backgrounds serve as context. For instance, in images of a dog either standing on grass or lying on a bed, the dog remains the central semantic element, whereas the background varies. Motivated by this, we propose LEArning Intrinsic Hierarchy (LEAH), a lightweight plug-and-play module designed to model hierarchical structure within images. LEAH consists of two components: a pruner that filters task-irrelevant tokens to extract key objects, and a constructor that embeds key objects and full images into hyperbolic space using adaptive entailment cones to capture compositional semantics. LEAH can be easily integrated into existing GCD frameworks with minimal modification. When applied to SimGCD, it achieves up to 13.2% accuracy improvement on fine-grained benchmarks, demonstrating its effectiveness in discovering subtle inter-class differences through hierarchical modeling.

Downloads

Published

2026-03-14

How to Cite

Duan, Y., He, J., Hu, Z., Ji, M., Wang, R., & Gao, Q. (2026). Learning Intrinsic Hierarchy for Generalized Category Discovery. Proceedings of the AAAI Conference on Artificial Intelligence, 40(25), 20950–20958. https://doi.org/10.1609/aaai.v40i25.39236

Issue

Section

AAAI Technical Track on Machine Learning II