Rethinking Cancer Gene Identification Through Graph Anomaly Analysis

Authors

  • Yilong Zang School of Hotel and Tourism Management, The Hong Kong Polytechnic University
  • Lingfei Ren School of Computing and Artificial Intelligence, Southwestern University of Finance and Economics
  • Yue Li School of Computer Science, Wuhan University
  • Zhikang Wang Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University
  • David Antony Selby German Research Center for Artificial Intelligence (DFKI) and RPTU Kaiserslautern
  • Zheng Wang School of Computer Science, Wuhan University
  • Sebastian Josef Vollmer German Research Center for Artificial Intelligence (DFKI) and RPTU Kaiserslautern
  • Hongzhi Yin University of Queensland
  • Jiangning Song Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University
  • Junhang Wu College of Information Science and Technology, Shihezi University

DOI:

https://doi.org/10.1609/aaai.v39i12.33436

Abstract

Graph neural networks (GNNs) have shown promise in integrating protein-protein interaction (PPI) networks for identifying cancer genes in recent studies. However, due to the insufficient modeling of the biological information in PPI networks, more faithfully depiction of complex protein interaction patterns for cancer genes within the graph structure remains largely unexplored. This study takes a pioneering step toward bridging biological anomalies in protein interactions caused by cancer genes to statistical graph anomaly. We find a unique graph anomaly exhibited by cancer genes, namely weight heterogeneity, which manifests as significantly higher variance in edge weights of cancer gene nodes within the graph. Additionally, from the spectral perspective, we demonstrate that the weight heterogeneity could lead to the "flattening out" of spectral energy, with a concentration towards the extremes of the spectrum. Building on these insights, we propose the HIerarchical-Perspective Graph Neural Network (HIPGNN) that not only determines spectral energy distribution variations on the spectral perspective, but also perceives detailed protein interaction context on the spatial perspective. Extensive experiments are conducted on two reprocessed datasets STRINGdb and CPDB, and the experimental results demonstrate the superiority of HIPGNN.

Downloads

Published

2025-04-11

How to Cite

Zang, Y., Ren, L., Li, Y., Wang, Z., Selby, D. A., Wang, Z., … Wu, J. (2025). Rethinking Cancer Gene Identification Through Graph Anomaly Analysis. Proceedings of the AAAI Conference on Artificial Intelligence, 39(12), 13161–13169. https://doi.org/10.1609/aaai.v39i12.33436

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management II