On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling

Authors

  • Xiaobao Wu Nanyang Technological University, Singapore
  • Fengjun Pan Nanyang Technological University, Singapore
  • Thong Nguyen National University of Singapore, Singapore
  • Yichao Feng Nanyang Technological University, Singapore
  • Chaoqun Liu Nanyang Technological University, Singapore DAMO Academy, Alibaba Group, Singapore
  • Cong-Duy Nguyen Nanyang Technological University, Singapore
  • Anh Tuan Luu Nanyang Technological University, Singapore

DOI:

https://doi.org/10.1609/aaai.v38i17.29895

Keywords:

NLP: Text Classification, NLP: Interpretability, Analysis, and Evaluation of NLP Models, NLP: Applications

Abstract

Hierarchical topic modeling aims to discover latent topics from a corpus and organize them into a hierarchy to understand documents with desirable semantic granularity. However, existing work struggles with producing topic hierarchies of low affinity, rationality, and diversity, which hampers document understanding. To overcome these challenges, we in this paper propose Transport Plan and Context-aware Hierarchical Topic Model (TraCo). Instead of early simple topic dependencies, we propose a transport plan dependency method. It constrains dependencies to ensure their sparsity and balance, and also regularizes topic hierarchy building with them. This improves affinity and diversity of hierarchies. We further propose a context-aware disentangled decoder. Rather than previously entangled decoding, it distributes different semantic granularity to topics at different levels by disentangled decoding. This facilitates the rationality of hierarchies. Experiments on benchmark datasets demonstrate that our method surpasses state-of-the-art baselines, effectively improving the affinity, rationality, and diversity of hierarchical topic modeling with better performance on downstream tasks.

Published

2024-03-24

How to Cite

Wu, X., Pan, F., Nguyen, T., Feng, Y., Liu, C., Nguyen, C.-D., & Luu, A. T. (2024). On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 19261-19269. https://doi.org/10.1609/aaai.v38i17.29895

Issue

Section

AAAI Technical Track on Natural Language Processing II