Predicting Disease Transmission from Geo-Tagged Micro-Blog Data

Authors

  • Adam Sadilek University of Rochester
  • Henry Kautz University of Rochester
  • Vincent Silenzio University of Rochester

DOI:

https://doi.org/10.1609/aaai.v26i1.8103

Keywords:

machine learning, class imbalance, location-based reasoning, text classification, disease spread, public health

Abstract

Researchers have begun to mine social network data in order to predict a variety of social, economic, and health related phenomena. While previous work has focused on predicting aggregate properties, such as the prevalence of seasonal influenza in a given country, we consider the task of fine-grained prediction of the health of specific people from noisy and incomplete data. We construct a probabilistic model that can predict if and when an individual will fall ill with high precision and good recall on the basis of his social ties and co-locations with other people, as revealed by their Twitter posts. Our model is highly scalable and can be used to predict general dynamic properties of individuals in large real-world social networks. These results provide a foundation for research on fundamental questions of public health, including the identification of non-cooperative disease carriers ("Typhoid Marys"), adaptive vaccination policies, and our understanding of the emergence of global epidemics from day-to-day interpersonal interactions.

Downloads

Published

2021-09-20

How to Cite

Sadilek, A., Kautz, H., & Silenzio, V. (2021). Predicting Disease Transmission from Geo-Tagged Micro-Blog Data. Proceedings of the AAAI Conference on Artificial Intelligence, 26(1), 136-142. https://doi.org/10.1609/aaai.v26i1.8103