A Non-parametric Graph Clustering Framework for Multi-View Data

Authors

  • Shengju Yu School of Computer, National University of Defense Technology
  • Siwei Wang Intelligent Game and Decision Lab
  • Zhibin Dong School of Computer, National University of Defense Technology
  • Wenxuan Tu School of Computer, National University of Defense Technology
  • Suyuan Liu School of Computer, National University of Defense Technology
  • Zhao Lv Intelligent Game and Decision Lab
  • Pan Li Intelligent Game and Decision Lab
  • Miao Wang Intelligent Game and Decision Lab
  • En Zhu School of Computer, National University of Defense Technology

DOI:

https://doi.org/10.1609/aaai.v38i15.29594

Keywords:

ML: Clustering, ML: Multi-instance/Multi-view Learning

Abstract

Multi-view graph clustering (MVGC) derives encouraging grouping results by seamlessly integrating abundant information inside heterogeneous data, and has captured surging focus recently. Nevertheless, the majority of current MVGC works involve at least one hyper-parameter, which not only requires additional efforts for tuning, but also leads to a complicated solving procedure, largely harming the flexibility and scalability of corresponding algorithms. To this end, in the article we are devoted to getting rid of hyper-parameters, and devise a non-parametric graph clustering (NpGC) framework to more practically partition multi-view data. To be specific, we hold that hyper-parameters play a role in balancing error item and regularization item so as to form high-quality clustering representations. Therefore, under without the assistance of hyper-parameters, how to acquire high-quality representations becomes the key. Inspired by this, we adopt two types of anchors, view-related and view-unrelated, to concurrently mine exclusive characteristics and common characteristics among views. Then, all anchors' information is gathered together via a consensus bipartite graph. By such ways, NpGC extracts both complementary and consistent multi-view features, thereby obtaining superior clustering results. Also, linear complexities enable it to handle datasets with over 120000 samples. Numerous experiments reveal NpGC's strong points compared to lots of classical approaches.

Downloads

Published

2024-03-24

How to Cite

Yu, S., Wang, S., Dong, Z., Tu, W., Liu, S., Lv, Z., Li, P., Wang, M., & Zhu, E. (2024). A Non-parametric Graph Clustering Framework for Multi-View Data. Proceedings of the AAAI Conference on Artificial Intelligence, 38(15), 16558-16567. https://doi.org/10.1609/aaai.v38i15.29594

Issue

Section

AAAI Technical Track on Machine Learning VI