Ranking Tweets by Labeled and Collaboratively Selected Pairs with Transitive Closure

Shenghua Liu; Xueqi Cheng; Fangtao Li

doi:10.1609/aaai.v28i1.8896

Authors

Shenghua Liu Chinese Academy of Sciences
Xueqi Cheng Chinese Academy of Sciences
Fangtao Li Google Inc.

DOI:

https://doi.org/10.1609/aaai.v28i1.8896

Keywords:

Microblog search, ranking tweets, co-training, semi-supervised learning, transitive closure

Abstract

Tweets ranking is important for information acquisition in Microblog. Due to the content sparsity and lackof labeled data, it is better to employ semi-supervisedlearning methods to utilize the unlabeled data. However,most of previous semi-supervised learning methods donot consider the pair conflict problem, which means thatthe new selected unlabeled data may conflict with the labeled and previously selected data. It will hurt the learning performance a lot, if the training data contains manyconflict pairs. In this paper, we propose a new collaborative semi-supervised SVM ranking model (CSR-TC)with consideration of the order conflict. The unlabeleddata is selected based on a dynamically maintained transitive closure graph to avoid pair conflict. We also investigate the two views of features, intrinsic and contentrelevant features, for the proposed model. Extensive experiments are conducted on TREC Microblogging corpus. The results demonstrate that our proposed methodachieves significant improvement, compared to severalstate-of-the-art models.

Ranking Tweets by Labeled and Collaboratively Selected Pairs with Transitive Closure

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription