Analyzing and Detecting Opinion Spam on a Large-scale Dataset via Temporal and Spatial Patterns

Authors

  • Huayi Li University of Illinois at Chicago
  • Zhiyuan Chen University of Illinois at Chicago
  • Arjun Mukherjee University of Houston
  • Bing Liu University of Illinois at Chicago
  • Jidong Shao Dianping Inc.

DOI:

https://doi.org/10.1609/icwsm.v9i1.14652

Keywords:

Spam Detection, Opinion Spam

Abstract

Although opinion spam (or fake review) detection has attracted significant research attention in recent years, the problem is far from solved. One key reason is that there is no large-scale ground truth labeled dataset available for model building. Some review hosting sites such as Yelp.com and Dianping.com have built fake review filtering systems to ensure the quality of their reviews, but their algorithms are trade secrets. Working with Dianping, we present the first large-scale analysis of restaurant reviews filtered by Dianping's fake review filtering system. Along with the analysis, we also propose some novel temporal and spatial features for supervised opinion spam detection. Our results show that these features significantly outperform existing state-of-art features.

Downloads

Published

2021-08-03

How to Cite

Li, H., Chen, Z., Mukherjee, A., Liu, B., & Shao, J. (2021). Analyzing and Detecting Opinion Spam on a Large-scale Dataset via Temporal and Spatial Patterns. Proceedings of the International AAAI Conference on Web and Social Media, 9(1), 634-637. https://doi.org/10.1609/icwsm.v9i1.14652