A General Clustering Agreement Index: For Comparing Disjoint and Overlapping Clusters

Authors

  • Reihaneh Rabbany University of Alberta
  • Osmar Zaïane University of Alberta

DOI:

https://doi.org/10.1609/aaai.v31i1.10905

Keywords:

Clustering Agreement, Overlapping Clusters, Cluster Evaluation, Cluster Validation, Community Detection

Abstract

A clustering agreement index quantifies the similarity between two given clusterings. It is most commonly used to compare the results obtained from different clustering algorithms against the ground-truth clustering in the benchmark datasets. In this paper, we present a general Clustering Agreement Index (CAI) for comparing disjoint and overlapping clusterings. CAI is generic and introduces a family of clustering agreement indexes. In particular, the two widely used indexes of Adjusted Rand Index (ARI), and Normalized Mutual Information (NMI), are special cases of the CAI. Our index, therefore, provides overlapping extensions for both these commonly used indexes, whereas their original formulations are only defined for disjoint cases. Lastly, unlike previous indexes, CAI is flexible and can be adapted to incorporate the structure of the data, which is important when comparing clusters in networks, a.k.a communities.

Downloads

Published

2017-02-13

How to Cite

Rabbany, R., & Zaïane, O. (2017). A General Clustering Agreement Index: For Comparing Disjoint and Overlapping Clusters. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.10905