A Probabilistic Model for Bursty Topic Discovery in Microblogs

Authors

  • Xiaohui Yan Institute of Computing Technology, Chinese Academy of Science
  • Jiafeng Guo Institute of Computing Technology, Chinese Academy of Science
  • Yanyan Lan Institute of Computing Technology, Chinese Academy of Science
  • Jun Xu Institute of Computing Technology, Chinese Academy of Science
  • Xueqi Cheng Institute of Computing Technology, Chinese Academy of Science

DOI:

https://doi.org/10.1609/aaai.v29i1.9199

Keywords:

short text, topic model, text mining, bursty topic, event detection

Abstract

Bursty topics discovery in microblogs is important for people to grasp essential and valuable information. However, the task is challenging since microblog posts are particularly short and noisy. This work develops a novel probabilistic model, namely Bursty Biterm Topic Model (BBTM), to deal with the task. BBTM extends the Biterm Topic Model (BTM) by incorporating the burstiness of biterms as prior knowledge for bursty topic modeling, which enjoys the following merits: 1) It can well solve the data sparsity problem in topic modeling over short texts as the same as BTM; 2) It can automatical discover high quality bursty topics in microblogs in a principled and efficient way. Extensive experiments on a standard Twitter dataset show that our approach outperforms the state-of-the-art baselines significantly.

Downloads

Published

2015-02-09

How to Cite

Yan, X., Guo, J., Lan, Y., Xu, J., & Cheng, X. (2015). A Probabilistic Model for Bursty Topic Discovery in Microblogs. Proceedings of the AAAI Conference on Artificial Intelligence, 29(1). https://doi.org/10.1609/aaai.v29i1.9199