Neural Dynamic Focused Topic Model

Authors

  • Kostadin Cvejoski Fraunhofer IAIS
  • Ramsés J. Sánchez BIT University of Bonn
  • César Ojeda University of Potsdam

DOI:

https://doi.org/10.1609/aaai.v37i11.26496

Keywords:

SNLP: Text Mining, ML: Deep Generative Models & Autoencoders, ML: Probabilistic Methods, SNLP: Applications

Abstract

Topic models and all their variants analyse text by learning meaningful representations through word co-occurrences. As pointed out by previous work, such models implicitly assume that the probability of a topic to be active and its proportion within each document are positively correlated. This correlation can be strongly detrimental in the case of documents created over time, simply because recent documents are likely better described by new and hence rare topics. In this work we leverage recent advances in neural variational inference and present an alternative neural approach to the dynamic Focused Topic Model. Indeed, we develop a neural model for topic evolution which exploits sequences of Bernoulli random variables in order to track the appearances of topics, thereby decoupling their activities from their proportions. We evaluate our model on three different datasets (the UN general debates, the collection of NeurIPS papers, and the ACL Anthology dataset) and show that it (i) outperforms state-of-the-art topic models in generalization tasks and (ii) performs comparably to them on prediction tasks, while employing roughly the same number of parameters, and converging about two times faster.

Downloads

Published

2023-06-26

How to Cite

Cvejoski, K., Sánchez, R. J., & Ojeda, C. (2023). Neural Dynamic Focused Topic Model. Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), 12719-12727. https://doi.org/10.1609/aaai.v37i11.26496

Issue

Section

AAAI Technical Track on Speech & Natural Language Processing