Progressive EM for Latent Tree Models and Hierarchical Topic Detection

Authors

  • Peixian Chen The Hong Kong University of Science and Technology
  • Nevin Zhang The Hong Kong University of Science and Technology
  • Leonard Poon The Hong Kong Institute of Education
  • Zhourong Chen The Hong Kong University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v30i1.10196

Abstract

Hierarchical latent tree analysis (HLTA) is recently proposed as a new method for topic detection. It differs fundamentally from the LDA-based methods in terms of topic definition, topic-document relationship, and learning method. It has been shown to discover significantly more coherent topics and better topic hierarchies. However, HLTA relies on the Expectation-Maximization (EM) algorithm for parameter estimation and hence is not efficient enough to deal with large datasets. In this paper, we propose a method to drastically speed up HLTA using a technique inspired by the advances in the method of moments. Empirical experiments show that our method greatly improves the efficiency of HLTA. It is as efficient as the state-of-the-art LDA-based method for hierarchical topic detection and finds substantially better topics and topic hierarchies.

Downloads

Published

2016-02-21

How to Cite

Chen, P., Zhang, N., Poon, L., & Chen, Z. (2016). Progressive EM for Latent Tree Models and Hierarchical Topic Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 30(1). https://doi.org/10.1609/aaai.v30i1.10196

Issue

Section

Technical Papers: Machine Learning Methods