Unidimensional Clustering of Discrete Data Using Latent Tree Models

Authors

  • April Liu The Hong Kong University of Science and Technology
  • Leonard Poon The Hong Kong Institute of Education
  • Nevin Zhang The Hong Kong University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v29i1.9593

Keywords:

Clustering, Latent Tree Models, Local Independence

Abstract

This paper is concerned with model-based clustering of discrete data. Latent class models (LCMs) are usually used for the task. An LCM consists of a latent variable and a number of attributes. It makes the overly restrictive assumption that the attributes are mutually independent given the latent variable. We propose a novel method to relax the assumption. The key idea is to partition the attributes into groups such that correlations among the attributes in each group can be properly modeled by using one single latent variable. The latent variables for the attribute groups are then used to build a number of models and one of them is chosen to produce the clustering results. Extensive empirical studies have been conducted to compare the new method with LCM and several other methods (K-means, kernel K-means and spectral clustering) that are not model-based. The new method outperforms the alternative methods in most cases and the differences are often large.

Downloads

Published

2015-02-21

How to Cite

Liu, A., Poon, L., & Zhang, N. (2015). Unidimensional Clustering of Discrete Data Using Latent Tree Models. Proceedings of the AAAI Conference on Artificial Intelligence, 29(1). https://doi.org/10.1609/aaai.v29i1.9593

Issue

Section

Main Track: Novel Machine Learning Algorithms