Behavioral Experiments in Email Filter Evasion

Authors

  • Liyiming Ke Vanderbilt University
  • Bo Li Vanderbilt University
  • Yevgeniy Vorobeychik Vanderbilt University

DOI:

https://doi.org/10.1609/aaai.v30i1.10061

Keywords:

Adversarial classification, game theory, evasion classification

Abstract

Despite decades of effort to combat spam, unwanted and even malicious emails, such as phish which aim to deceive recipients into disclosing sensitive information, still routinely find their way into one's mailbox.To be sure, email filters manage to stop a large fraction of spam emails from ever reaching users, but spammers and phishers have mastered the art of filter evasion, or manipulating the content of email messages to avoid being filtered.We present a unique behavioral experiment designed to study email filter evasion.Our experiment is framed in somewhat broader terms: given the widespread use of machine learning methods for distinguishing spam and non-spam, we investigate how human subjects manipulate a spam template to evade a classification-based filter.We find that adding a small amount of noise to a filter significantly reduces the ability of subjects to evade it, observing that noise does not merely have a short-term impact, but also degrades evasion performance in the longer term.Moreover, we find that greater coverage of an email template by the classifier (filter) features significantly increases the difficulty of evading it.This observation suggests that aggressive feature reduction — a common practice in applied machine learning — can actually facilitate evasion.In addition to the descriptive analysis of behavior, we develop a synthetic model of human evasion behavior which closely matches observed behavior and effectively replicates experimental findings in simulation.

Downloads

Published

2016-02-21

How to Cite

Ke, L., Li, B., & Vorobeychik, Y. (2016). Behavioral Experiments in Email Filter Evasion. Proceedings of the AAAI Conference on Artificial Intelligence, 30(1). https://doi.org/10.1609/aaai.v30i1.10061

Issue

Section

Technical Papers: Human-Computation and Crowd Sourcing