Mamba-Driven Multi-View Discriminative Clustering via Global-Local Cross-View Sequence Modeling

Authors

  • Yuanyang Zhang School of Computer Science and Engineering, Southeast University, Nanjing 210096, China
  • Xinhang Wan College of Systems Engineering, National University of Defense Technology, Changsha, China
  • Chao Zhang School of Robotics and Automation, Nanjing University, Suzhou, China
  • Jie Xu Information Systems Technology and Design Pillar, Singapore University of Technology and Design, Singapore
  • Cunjian Chen Department of Data Science & AI, Faculty of Information Technology, Monash University, Melbourne, Australia
  • Tien-Tsin Wong Department of Data Science & AI, Faculty of Information Technology, Monash University, Melbourne, Australia
  • Li Yao School of Computer Science and Engineering, Southeast University, Nanjing 210096, China Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China
  • Yijie Lin The Hong Kong University of Science and Technology, Hong Kong SAR, China

DOI:

https://doi.org/10.1609/aaai.v40i34.40082

Abstract

Multi-view clustering (MVC) has recently garnered increasing attention for its ability to partition unlabeled samples into distinct clusters by leveraging complementary and consistent information from different views. Existing MVC methods primarily combine deep neural networks with contrastive learning for cross-view representation learning, yet often overlook the inherent global-local structural relationships among samples. While GNN-based methods capture local structures, they struggle to model global dependencies, leading to inferior inter-cluster separability. In contrast, Transformer-based methods excel at global aggregation but suffer from quadratic complexity, and their attention smoothing effect weakens fine-grained local structures, resulting in suboptimal intra-cluster compactness. To address these limitations, we propose a novel end-to-end MVC framework called Mamba-Driven Multi-View Discriminative Clustering via Global-Local Cross-View Sequence Modeling (MGLC). By flexibly constructing multi-view sequences, MGLC fully exploits the efficient sequence modeling capabilities of Mamba to jointly model cross-view dependencies and global-local structural relationships among samples. Furthermore, MGLC introduces a Cross-Mamba Fusion module to dynamically integrate cross-view and global-local structural representations. Additionally, MGLC incorporates a Dual Calibration Contrastive Learning module, guided by high-confidence pseudo-labels, that adaptively refines both feature and semantic representations while mitigating false negatives among semantically similar samples. Extensive comparative experiments and ablation studies demonstrate the effectiveness of MGLC.

Published

2026-03-14

How to Cite

Zhang, Y., Wan, X., Zhang, C., Xu, J., Chen, C., Wong, T.-T., … Lin, Y. (2026). Mamba-Driven Multi-View Discriminative Clustering via Global-Local Cross-View Sequence Modeling. Proceedings of the AAAI Conference on Artificial Intelligence, 40(34), 28519–28527. https://doi.org/10.1609/aaai.v40i34.40082

Issue

Section

AAAI Technical Track on Machine Learning XI