From AI Principles to AI Assurance: an Online Safety Case Study

Authors

  • Miranda Cross UK Office of Communications
  • Andreas Gutmann Office of Communications, University College London
  • Ismini Psychoula Office of Communications, University College London
  • Pedro Friere Aston University

DOI:

https://doi.org/10.1609/aaaiss.v7i1.36860

Abstract

Principles-based frameworks for AI assurance have been proposed for various AI/ML use cases, focusing on aspects such as ethical design, trustworthiness, and safety. However, translating these high-level principles into actionable, objective criteria for auditing, particularly by third parties, remains challenging. Our analysis shows this is due to the inherent subjectivity of principles, the need for vertical frameworks tailored to specific AI/ML applications, and the unreliability of information gathered during the assurance process. In this paper, we present a case study on how to develop and operationalise a principles-based framework for AI assurance aimed at assessing the ‘accuracy’ of child sexual exploitation (CSEA) and terrorism detection technologies in the context of online safety. The proposed assurance framework addresses a requirement in the UK's 2023 Online Safety Act to create an 'accreditation' scheme specifically for CSEA and terrorism detection technologies. We discuss the critical challenges for operationalising such principles-based frameworks for assurance, particularly in relation to ensuring transparency, reliability, and consistency in audits. We also map potential issues which remain for effectively assessing and auditing AI/ML technologies, informing the development of future research agendas which further research and development of robust standards for assurance, particularly in sociotechnical contexts.

Downloads

Published

2025-11-23

How to Cite

Cross, M., Gutmann, A., Psychoula, I., & Friere, P. (2025). From AI Principles to AI Assurance: an Online Safety Case Study. Proceedings of the AAAI Symposium Series, 7(1), 2–10. https://doi.org/10.1609/aaaiss.v7i1.36860

Issue

Section

AI for Social Good: Emerging Methods, Measures, Data, and Ethics