Revisiting the Trolley Problem for AI: Biases and Stereotypes in Large Language Models and their Impact on Ethical Decision-Making

Sahan Hatemo; Christof Weickhardt; Luca Gisler; Oliver Bendel

doi:10.1609/aaaiss.v5i1.35590

Revisiting the Trolley Problem for AI: Biases and Stereotypes in Large Language Models and their Impact on Ethical Decision-Making

Authors

Sahan Hatemo FHNW School of Computer Science
Christof Weickhardt FHNW School of Computer Science
Luca Gisler FHNW School of Computer Science
Oliver Bendel FHNW School of Business

DOI:

https://doi.org/10.1609/aaaiss.v5i1.35590

Abstract

The trolley problem has long served as a lens for exploring moral decision-making, now gaining renewed significance in the context of artificial intelligence (AI). This study investigates ethical reasoning in three open-source large language models (LLMs)—LLaMA, Mistral and Qwen—through variants of the trolley problem. By introducing demographic prompts (age, nationality and gender) into three scenarios (switch, loop and footbridge), we systematically evaluate LLM responses against human survey data from the Moral Machine experiment. Our findings reveal notable differences: Mistral exhibits a consistent tendency to overintervene, while Qwen chooses to intervene less and LLaMA balances between the two. Notably demographic attributes, particularly nationality, significantly influence LLM decisions, exposing potential biases in AI ethical reasoning. These insights underscore the necessity of refining LLMs to ensure fairness and ethical alignment, leading the way for more trustworthy AI systems.

AAAI Summer Symposium 2025 Proceedings Cover

Downloads

Published

2025-05-28

How to Cite

Hatemo, S., Weickhardt, C., Gisler, L., & Bendel, O. (2025). Revisiting the Trolley Problem for AI: Biases and Stereotypes in Large Language Models and their Impact on Ethical Decision-Making. Proceedings of the AAAI Symposium Series, 5(1), 213–219. https://doi.org/10.1609/aaaiss.v5i1.35590

Download Citation

Issue

Vol. 5 No. 1: Proceedings of the 2025 AAAI Spring Symposium Series

Section

Human-Compatible AI for Well-being (Full Papers)

Revisiting the Trolley Problem for AI: Biases and Stereotypes in Large Language Models and their Impact on Ethical Decision-Making

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information