Wasserstein Graph Distance Based on L1–Approximated Tree Edit Distance between Weisfeiler–Lehman Subtrees

Authors

  • Zhongxi Fang WASEDA University
  • Jianming Huang WASEDA University
  • Xun Su WASEDA University
  • Hiroyuki Kasai WASEDA University

DOI:

https://doi.org/10.1609/aaai.v37i6.25916

Keywords:

ML: Graph-based Machine Learning, DMKM: Graph Mining, Social Network Analysis & Community Mining

Abstract

The Weisfeiler-Lehman (WL) test is a widely used algorithm in graph machine learning, including graph kernels, graph metrics, and graph neural networks. However, it focuses only on the consistency of the graph, which means that it is unable to detect slight structural differences. Consequently, this limits its ability to capture structural information, which also limits the performance of existing models that rely on the WL test. This limitation is particularly severe for traditional metrics defined by the WL test, which cannot precisely capture slight structural differences. In this paper, we propose a novel graph metric called the Wasserstein WL Subtree (WWLS) distance to address this problem. Our approach leverages the WL subtree as structural information for node neighborhoods and defines node metrics using the L1-approximated tree edit distance (L1-TED) between WL subtrees of nodes. Subsequently, we combine the Wasserstein distance and the L1-TED to define the WWLS distance, which can capture slight structural differences that may be difficult to detect using conventional metrics. We demonstrate that the proposed WWLS distance outperforms baselines in both metric validation and graph classification experiments.

Downloads

Published

2023-06-26

How to Cite

Fang, Z., Huang, J., Su, X., & Kasai, H. (2023). Wasserstein Graph Distance Based on L1–Approximated Tree Edit Distance between Weisfeiler–Lehman Subtrees. Proceedings of the AAAI Conference on Artificial Intelligence, 37(6), 7539-7549. https://doi.org/10.1609/aaai.v37i6.25916

Issue

Section

AAAI Technical Track on Machine Learning I