Mamba-Driven Multi-View Discriminative Clustering via Global-Local Cross-View Sequence Modeling

Yuanyang Zhang; Xinhang Wan; Chao Zhang; Jie Xu; Cunjian Chen; Tien-Tsin Wong; Li Yao; Yijie Lin

doi:10.1609/aaai.v40i34.40082

Authors

Yuanyang Zhang School of Computer Science and Engineering, Southeast University, Nanjing 210096, China
Xinhang Wan College of Systems Engineering, National University of Defense Technology, Changsha, China
Chao Zhang School of Robotics and Automation, Nanjing University, Suzhou, China
Jie Xu Information Systems Technology and Design Pillar, Singapore University of Technology and Design, Singapore
Cunjian Chen Department of Data Science & AI, Faculty of Information Technology, Monash University, Melbourne, Australia
Tien-Tsin Wong Department of Data Science & AI, Faculty of Information Technology, Monash University, Melbourne, Australia
Li Yao School of Computer Science and Engineering, Southeast University, Nanjing 210096, China Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China
Yijie Lin The Hong Kong University of Science and Technology, Hong Kong SAR, China

DOI:

https://doi.org/10.1609/aaai.v40i34.40082

Abstract

Multi-view clustering (MVC) has recently garnered increasing attention for its ability to partition unlabeled samples into distinct clusters by leveraging complementary and consistent information from different views. Existing MVC methods primarily combine deep neural networks with contrastive learning for cross-view representation learning, yet often overlook the inherent global-local structural relationships among samples. While GNN-based methods capture local structures, they struggle to model global dependencies, leading to inferior inter-cluster separability. In contrast, Transformer-based methods excel at global aggregation but suffer from quadratic complexity, and their attention smoothing effect weakens fine-grained local structures, resulting in suboptimal intra-cluster compactness. To address these limitations, we propose a novel end-to-end MVC framework called Mamba-Driven Multi-View Discriminative Clustering via Global-Local Cross-View Sequence Modeling (MGLC). By flexibly constructing multi-view sequences, MGLC fully exploits the efficient sequence modeling capabilities of Mamba to jointly model cross-view dependencies and global-local structural relationships among samples. Furthermore, MGLC introduces a Cross-Mamba Fusion module to dynamically integrate cross-view and global-local structural representations. Additionally, MGLC incorporates a Dual Calibration Contrastive Learning module, guided by high-confidence pseudo-labels, that adaptively refines both feature and semantic representations while mitigating false negatives among semantically similar samples. Extensive comparative experiments and ablation studies demonstrate the effectiveness of MGLC.

Mamba-Driven Multi-View Discriminative Clustering via Global-Local Cross-View Sequence Modeling

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information