On Forecasting Lags in AI Risk Evaluation

Authors

  • Paolo Bova Teesside University

DOI:

https://doi.org/10.1609/aies.v8i3.36768

Abstract

AI risk evaluations are key to building safety cases for new AI systems. These evaluations help us track how quickly new AI systems are gaining competencies and propensities to engage in harmful behavior. However, it is not currently clear how well we are doing in covering the surface of potential risks that AI systems could present as they continue to improve. This paper proposes a modelling framework for predicting lags in AI risk evaluations, illustrating our approach using data from METR's evaluations of the ability for AI to complete long tasks. We find in our simulations that a business-as-usual approach to tracking AI risks is vulnerable to substantial detection lags when facing budget constraints. We advise greater adoption of the logistic estimator proposed by METR where appropriate, as it mitigates against such lags.

Downloads

Published

2025-10-15

How to Cite

Bova, P. (2025). On Forecasting Lags in AI Risk Evaluation. Proceedings of the AAAI ACM Conference on AI, Ethics, and Society, 8(3), 2850–2852. https://doi.org/10.1609/aies.v8i3.36768