Cooperative Graph Transformer with Structural Consensus for Multi-View Learning

Zhiyuan Lai; Jiacheng Li; Jiayuan Wang; Shiping Wang

doi:10.1609/aaai.v40i27.39437

Authors

Zhiyuan Lai Fuzhou University
Jiacheng Li Fuzhou University
Jiayuan Wang Fuzhou University
Shiping Wang Fuzhou University

DOI:

https://doi.org/10.1609/aaai.v40i27.39437

Abstract

Multi-view learning aims to effectively integrate data from different sources by exploring the consistency and complementarity across views. Current multi-view methods based on Graph Convolutional Networks (GCNs) primarily focus on local information, making it difficult to capture global dependencies. Furthermore, multi-view data typically lack explicit structural representations, and the topologies constructed via node similarity in existing approaches are prone to noise, while simple fusion strategies are often inadequate for effectively suppressing this noise and for uncovering meaningful structural information. To tackle these issues, this paper proposes CoGFormer, a cooperative graph transformer with structural consensus learning. CoGFormer maps multi-view data into a unified space and jointly models local and global consensus: a denoising structural consensus graph convolutional network refines the consensus graph to enhance local consistency and robustness, while a structure-guided attention mechanism explicitly injects high-order cross-view structural biases to capture global consistency and improve semantic coherence. Experiments on multiple benchmarks demonstrate that CoGFormer outperforms existing state-of-the-art methods, validating its effectiveness.

Cooperative Graph Transformer with Structural Consensus for Multi-View Learning

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information