Cooperative Graph Transformer with Structural Consensus for Multi-View Learning

Authors

  • Zhiyuan Lai Fuzhou University
  • Jiacheng Li Fuzhou University
  • Jiayuan Wang Fuzhou University
  • Shiping Wang Fuzhou University

DOI:

https://doi.org/10.1609/aaai.v40i27.39437

Abstract

Multi-view learning aims to effectively integrate data from different sources by exploring the consistency and complementarity across views. Current multi-view methods based on Graph Convolutional Networks (GCNs) primarily focus on local information, making it difficult to capture global dependencies. Furthermore, multi-view data typically lack explicit structural representations, and the topologies constructed via node similarity in existing approaches are prone to noise, while simple fusion strategies are often inadequate for effectively suppressing this noise and for uncovering meaningful structural information. To tackle these issues, this paper proposes CoGFormer, a cooperative graph transformer with structural consensus learning. CoGFormer maps multi-view data into a unified space and jointly models local and global consensus: a denoising structural consensus graph convolutional network refines the consensus graph to enhance local consistency and robustness, while a structure-guided attention mechanism explicitly injects high-order cross-view structural biases to capture global consistency and improve semantic coherence. Experiments on multiple benchmarks demonstrate that CoGFormer outperforms existing state-of-the-art methods, validating its effectiveness.

Downloads

Published

2026-03-14

How to Cite

Lai, Z., Li, J., Wang, J., & Wang, S. (2026). Cooperative Graph Transformer with Structural Consensus for Multi-View Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(27), 22751–22759. https://doi.org/10.1609/aaai.v40i27.39437

Issue

Section

AAAI Technical Track on Machine Learning IV