Methods to Determine Node Centrality and Clustering in Graphs with Uncertain Structure

Authors

  • Joseph Pfeiffer Purdue University
  • Jennifer Neville Purdue University

Abstract

Much of the past work in network analysis has focused on analyzing discrete graphs, where binary edges represent the “presence” or “absence” of a relationship. Since traditional network measures (e.g., betweenness centrality) assume a discrete link structure, data about complex systems must be transformed to this representation before calculating network properties. In many domains where there may be uncertainty about the relationship structure, this transformation to a discrete representation will result in a lose of information. In order to represent and reason with network uncertainty, we move beyond the discrete graph framework and develop social network measures based on a probabilistic graph representation. More specifically, we develop measures of path length, betweenness centrality, and clustering coefficient— one set based on sampling and one based on probabilistic paths. We evaluate our methods on two real-world networks, Enron and Facebook, showing that our proposed methods more accurately capture salient effects without being susceptible to local noise.

Downloads

Published

2021-08-03

How to Cite

Pfeiffer, J., & Neville, J. (2021). Methods to Determine Node Centrality and Clustering in Graphs with Uncertain Structure. Proceedings of the International AAAI Conference on Web and Social Media, 5(1), 590-593. Retrieved from https://ojs.aaai.org/index.php/ICWSM/article/view/14187