Spatial Graph Attention Network Modeling for Neighborhood-Scale Lead Contamination Risk Prediction Using Publicly Available Data

Raphael Anaadumba; Nazim A.Belabbaci; Connor Sullivan; Anton Kovalev; Yidong Zhu; Pradeep Kurup; Mohammad Arif Ul Alam

doi:10.1609/aaai.v40i47.41455

Authors

Raphael Anaadumba Richard A. Miner School of Computer and Information Sciences, University of Massachusetts Lowell, Lowell, USA
Nazim A.Belabbaci Richard A. Miner School of Computer and Information Sciences, University of Massachusetts Lowell, Lowell, USA
Connor Sullivan Civil and Environmental Engineering, University of Massachusetts Lowell, Lowell, USA
Anton Kovalev Richard A. Miner School of Computer and Information Sciences, University of Massachusetts Lowell, Lowell, USA
Yidong Zhu Richard A. Miner School of Computer and Information Sciences, University of Massachusetts Lowell, Lowell, USA
Pradeep Kurup Civil and Environmental Engineering, University of Massachusetts Lowell, Lowell, USA
Mohammad Arif Ul Alam Richard A. Miner School of Computer and Information Sciences, University of Massachusetts Lowell, Lowell, USA Department of Medicine, University of Massachusetts Chan Medical School, Worcester, USA National Institute of Aging, National Institute of Health, Bethesda, USA

DOI:

https://doi.org/10.1609/aaai.v40i47.41455

Abstract

Lead contamination in urban water systems remains a prevalent public health threat, affecting millions of American households and disproportionately endangering vulnerable population groups. Current municipal risk assessment and inspection strategies are overwhelmingly based on random sampling and complaint-driven protocols that overlook spatial complexity, reinforce inequities, and squander limited resources, leaving critical exposure areas unidentified. This paper presents a lead contamination risk prediction framework from socio-demographic housing features analytics, first of its kind, by drawing on partially anonymized residential testing data as ground truth and applying graph neural networks alongside gradient-boosted ensembles. Specifically, our method integrates spatial Deep Graph Attention Networks classifiers to capture inter-neighborhood contamination dependencies, fuse demographic and spatial evidence, and produce interpretable risk scores. Those scores are actionable by municipal water authorities at the intra-neighborhood level. Through extensive experiments on newly constructed Chicago block-group level datasets, our framework achieves a balanced accuracy of 84.8% and reduces false positive lead contamination by up to 44% versus spatial-only baselines and 21% over current practice, without sacrificing recall on contaminated blocks. Our approach not only extends technical boundaries in spatial-ensemble learning and privacy-preserving urban health modeling, but also provides policymakers and public health officials with a means to assess and address contamination risks, supporting efforts to protect community health and safety.

Spatial Graph Attention Network Modeling for Neighborhood-Scale Lead Contamination Risk Prediction Using Publicly Available Data

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information