GeCC: Generalized Contrastive Clustering with Domain Shifts Modeling

Authors

  • Yujie Chen Shenzhen University
  • Wenhui Wu Shenzhen University
  • Le Ou-Yang Shenzhen University
  • Ran Wang Shenzhen University
  • Debby D. Wang Hong Kong Metropolitan University

DOI:

https://doi.org/10.1609/aaai.v39i15.33753

Abstract

Contrastive clustering performs clustering and data representation in a unified model, where instance- and cluster-level constrastive learning are conducted simultaneously. However, commonly-used data augmentation methods make contrastive mechanism effect but may cause representation learning getting stuck in domain-specific information, which further deteriorates clustering performance and limits generalization ability. To this end, we propose a new framework, named Generalized Contrastive Clustering with domain shifts modeling (GeCC), which can integrate diverse domain knowledge to improve the clustering performance. Specifically, we first design a cluster-guided domain shifts modeling module to synthesize a reference view with diverse domain information. Then, we introduce instance representation and cluster assignment contrastive modules with well-designed attention weights to guide the representation learning and clustering. In this way, our method can maximize the extraction of cluster-related information and avoid over-fitting domain-specific features. Experimental results on four benchmark datasets demonstrate that our proposed method consistently outperforms other state-of-the-art methods.

Downloads

Published

2025-04-11

How to Cite

Chen, Y., Wu, W., Ou-Yang, L., Wang, R., & Wang, D. D. (2025). GeCC: Generalized Contrastive Clustering with Domain Shifts Modeling. Proceedings of the AAAI Conference on Artificial Intelligence, 39(15), 15966-15974. https://doi.org/10.1609/aaai.v39i15.33753

Issue

Section

AAAI Technical Track on Machine Learning I