GoT: a Growing Tree Model for Clustering Ensemble


  • Feijiang Li Shanxi University
  • Yuhua Qian Shanxi University
  • Jieting Wang Shanxi University



Clustering, Ensemble Methods


The clustering ensemble technique that integrates multiple clustering results can improve the accuracy and robustness of the final clustering. In many clustering ensemble algorithms, the co-association matrix (CA matrix), which reflects the frequency of any two samples being partitioned into the same cluster, plays an important role. However, generally, the CA matrix is highly sparse with low value density, which may limit the performance of an algorithm based on it. To handle these issues, in this paper, we propose a growing tree model (GoT). In this model, the CA matrix is firstly refined by the shortest path technique so that its sparsity will be mitigated. Then, a set of representative prototype examples is discovered. Finally, to handle the low value density of the CA matrix, the prototypes gradually connect to their neighborhood, which likes a set of trees growing up. The rationality of the discovered prototype examples is illustrated by theoretical analysis and experimental analysis. The working mechanism of the GoT is visually shown on synthetic data sets. Experimental analyses on eight UCI data sets and eight image data sets show that the GoT outperforms nine representative clustering ensemble algorithms.




How to Cite

Li, F., Qian, Y., & Wang, J. (2021). GoT: a Growing Tree Model for Clustering Ensemble. Proceedings of the AAAI Conference on Artificial Intelligence, 35(9), 8349-8356.



AAAI Technical Track on Machine Learning II