Spatial Graph Attention Network Modeling for Neighborhood-Scale Lead Contamination Risk Prediction Using Publicly Available Data

Authors

  • Raphael Anaadumba Richard A. Miner School of Computer and Information Sciences, University of Massachusetts Lowell, Lowell, USA
  • Nazim A.Belabbaci Richard A. Miner School of Computer and Information Sciences, University of Massachusetts Lowell, Lowell, USA
  • Connor Sullivan Civil and Environmental Engineering, University of Massachusetts Lowell, Lowell, USA
  • Anton Kovalev Richard A. Miner School of Computer and Information Sciences, University of Massachusetts Lowell, Lowell, USA
  • Yidong Zhu Richard A. Miner School of Computer and Information Sciences, University of Massachusetts Lowell, Lowell, USA
  • Pradeep Kurup Civil and Environmental Engineering, University of Massachusetts Lowell, Lowell, USA
  • Mohammad Arif Ul Alam Richard A. Miner School of Computer and Information Sciences, University of Massachusetts Lowell, Lowell, USA Department of Medicine, University of Massachusetts Chan Medical School, Worcester, USA National Institute of Aging, National Institute of Health, Bethesda, USA

DOI:

https://doi.org/10.1609/aaai.v40i47.41455

Abstract

Lead contamination in urban water systems remains a prevalent public health threat, affecting millions of American households and disproportionately endangering vulnerable population groups. Current municipal risk assessment and inspection strategies are overwhelmingly based on random sampling and complaint-driven protocols that overlook spatial complexity, reinforce inequities, and squander limited resources, leaving critical exposure areas unidentified. This paper presents a lead contamination risk prediction framework from socio-demographic housing features analytics, first of its kind, by drawing on partially anonymized residential testing data as ground truth and applying graph neural networks alongside gradient-boosted ensembles. Specifically, our method integrates spatial Deep Graph Attention Networks classifiers to capture inter-neighborhood contamination dependencies, fuse demographic and spatial evidence, and produce interpretable risk scores. Those scores are actionable by municipal water authorities at the intra-neighborhood level. Through extensive experiments on newly constructed Chicago block-group level datasets, our framework achieves a balanced accuracy of 84.8% and reduces false positive lead contamination by up to 44% versus spatial-only baselines and 21% over current practice, without sacrificing recall on contaminated blocks. Our approach not only extends technical boundaries in spatial-ensemble learning and privacy-preserving urban health modeling, but also provides policymakers and public health officials with a means to assess and address contamination risks, supporting efforts to protect community health and safety.

Published

2026-03-14

How to Cite

Anaadumba, R., A.Belabbaci, N., Sullivan, C., Kovalev, A., Zhu, Y., Kurup, P., & Alam, M. A. U. (2026). Spatial Graph Attention Network Modeling for Neighborhood-Scale Lead Contamination Risk Prediction Using Publicly Available Data. Proceedings of the AAAI Conference on Artificial Intelligence, 40(47), 40193–40201. https://doi.org/10.1609/aaai.v40i47.41455

Issue

Section

IAAI Technical Track on Emerging Applications of AI