Veli: Unsupervised Method and Unified Benchmark for Low-Cost Air Quality Sensor Correction

Authors

  • Yahia Dalbah University of Amsterdam
  • Marcel Worring University of Amsterdam
  • Yen-Chia Hsu University of Amsterdam

DOI:

https://doi.org/10.1609/aaai.v40i25.39206

Abstract

Urban air pollution is a major health crisis causing millions of premature deaths annually, underscoring the urgent need for accurate and scalable monitoring of air quality (AQ). While low-cost sensors (LCS) offer a scalable alternative to expensive reference-grade stations, their readings are affected by drift, calibration errors, and environmental interference. To address these challenges, we introduce Veli (Reference free Variational Estimation via Latent Inference), an unsupervised Bayesian model that leverages variational inference to correct LCS readings without requiring co-location with reference stations, eliminating a major deployment barrier. Specifically, Veli constructs a disentangled representation of the LCS readings, effectively separating the true pollutant reading from the sensor noise. To build our model and address the lack of standardized benchmarks in AQ monitoring, we also introduce the Air Quality Sensor Data Repository (AQ-SDR). AQ-SDR is the largest AQ sensor benchmark to date, with readings from 23,737 LCS and reference stations across multiple regions. Veli demonstrates strong generalization across both in-distribution and out-of-distribution settings, effectively handling sensor drift and erratic sensor behavior. Appendices are available in the extended version.

Downloads

Published

2026-03-14

How to Cite

Dalbah, Y., Worring, M., & Hsu, Y.-C. (2026). Veli: Unsupervised Method and Unified Benchmark for Low-Cost Air Quality Sensor Correction. Proceedings of the AAAI Conference on Artificial Intelligence, 40(25), 20684–20692. https://doi.org/10.1609/aaai.v40i25.39206

Issue

Section

AAAI Technical Track on Machine Learning II