Towards Trustworthy Autonomous Systems via Conversations and Explanations


  • Balint Gyevnar University of Edinburgh



Trustworthy Autonomous Systems, Social Explainable AI, Multi-agent Systems, Human-centric AI, AI Regulation


Autonomous systems fulfil an increasingly important role in our societies, however, AI-powered systems have seen less success over the years, as they are expected to tackle a range of social, legal, or technological challenges and modern neural network-based AI systems cannot yet provide guarantees to many of these challenges. Particularly important is that these systems are black box decision makers, eroding human oversight, contestation, and agency. To address this particular concern, my thesis focuses on integrating social explainable AI with cognitive methods and natural language processing to shed light on the internal processes of autonomous systems in a way accessible to lay users. I propose a causal explanation generation model for decision-making called CEMA based on counterfactual simulations in multi-agent systems. I also plan to integrate CEMA with a broader natural language processing pipeline to support targeted and personalised explanations that address people's cognitive biases. I hope that my research will have a positive impact on the public acceptance of autonomous agents by building towards more trustworthy AI.




How to Cite

Gyevnar, B. (2024). Towards Trustworthy Autonomous Systems via Conversations and Explanations. Proceedings of the AAAI Conference on Artificial Intelligence, 38(21), 23389-23390.