DC-SPAN: A Dual Contrastive Attention Network for Multi-View Clustering

Jingyi Chen; Zhibin Dong; Tiejun Li; Yibo Han

doi:10.1609/aaai.v40i24.39099

Authors

Jingyi Chen National University of Defense Technology
Zhibin Dong National University of Defense Technology
Tiejun Li National University of Defense Technology
Yibo Han National University of Defense Technology

DOI:

https://doi.org/10.1609/aaai.v40i24.39099

Abstract

Multi-view clustering aims to group data by integrating complementary information from multiple views. However, the inherent heterogeneity among views often leads to feature entanglement, severely limiting clustering performance. To address this challenge, we propose DC-SPAN—a Dual Contrastive Attention Network—grounded in a disentangle-then-fuse paradigm. DC-SPAN employs a dual-path variational architecture to explicitly decompose each view into shared and private latent subspaces. These representations are then robustly integrated via a Product-of-Experts (PoE) mechanism. At the heart of our model is a novel dual contrastive learning objective that simultaneously encourages alignment of shared components across views and enforces separation of private ones, enabling structured and disentangled representations. A gated attention fusion module further adaptively aggregates these latent factors to yield a unified, discriminative embedding. The overall model is trained end-to-end using a composite loss function that incorporates reconstruction, orthogonality, and contrastive terms, along with a two-stage training scheme for improved stability. Extensive experiments on benchmark datasets demonstrate that DC-SPAN consistently outperforms existing state-of-the-art methods, highlighting its effectiveness and robustness in handling multi-view heterogeneity.

DC-SPAN: A Dual Contrastive Attention Network for Multi-View Clustering

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information