Revisiting the trolley problem for AI: biases and stereotypes in Large Language Models and their impact on ethical decision-making

Hatemo, Sahan; Weickhardt, Christof; Gisler, Luca; Bendel, Oliver

Revisiting the trolley problem for AI: biases and stereotypes in Large Language Models and their impact on ethical decision-making

dc.contributor.author	Hatemo, Sahan
dc.contributor.author	Weickhardt, Christof
dc.contributor.author	Gisler, Luca
dc.contributor.author	Bendel, Oliver
dc.contributor.editor	Petrick, Ron
dc.contributor.editor	Geib, Christopher
dc.date.accessioned	2025-07-28T13:04:55Z
dc.date.issued	2025
dc.description.abstract	The trolley problem has long served as a lens for exploring moral decision-making, now gaining renewed significance in the context of artificial intelligence (AI). This study investigates ethical reasoning in three open-source large language models (LLMs)—LLaMA, Mistral and Qwen—through variants of the trolley problem. By introducing demographic prompts (age, nationality and gender) into three scenarios (switch, loop and footbridge), we systematically evaluate LLM responses against human survey data from the Moral Machine experiment. Our findings reveal notable differences: Mistral exhibits a consistent tendency to overintervene, while Qwen chooses to intervene less and LLaMA balances between the two. Notably demographic attributes, particularly nationality, significantly influence LLM decisions, exposing potential biases in AI ethical reasoning. These insights underscore the necessity of refining LLMs to ensure fairness and ethical alignment, leading the way for more trustworthy AI systems.
dc.event	2025 AAAI Spring Symposium
dc.event.end	2025-04-02
dc.event.start	2025-03-31
dc.identifier.doi	10.1609/aaaiss.v5i1.35590
dc.identifier.uri	https://irf.fhnw.ch/handle/11654/52099
dc.language.iso	en
dc.publisher	AAAI Press
dc.relation.ispartof	Proceedings of the 2025 AAAI Spring Symposium Series
dc.spatial	San Francisco
dc.subject.ddc	330 - Wirtschaft
dc.subject.ddc	005 - Computer Programmierung, Programme und Daten
dc.title	Revisiting the trolley problem for AI: biases and stereotypes in Large Language Models and their impact on ethical decision-making
dc.type	04B - Beitrag Konferenzschrift
dspace.entity.type	Publication
fhnw.InventedHere	Yes
fhnw.ReviewType	Anonymous ex ante peer review of a complete publication
fhnw.affiliation.hochschule	Hochschule für Wirtschaft FHNW	de_CH
fhnw.affiliation.institut	Institut für Wirtschaftsinformatik	de_CH
fhnw.openAccessCategory	Closed
fhnw.pagination	213-219
fhnw.publicationState	Published
relation.isAuthorOfPublication	d8c9d823-cabc-40f0-85d9-e5321a887f22
relation.isAuthorOfPublication	41261aa9-9368-4700-8adf-f09487bf7e7e
relation.isAuthorOfPublication	87460844-8df7-4204-80ad-a33ed72cc96c
relation.isAuthorOfPublication	47ab0867-6bcc-4476-9891-def80a6fcc9b
relation.isAuthorOfPublication.latestForDiscovery	d8c9d823-cabc-40f0-85d9-e5321a887f22

Dateien

Lizenzbündel

Gerade angezeigt 1 - 1 von 1

Name:: license.txt
Größe:: 2.66 KB
Format:: Item-specific license agreed upon to submission
Beschreibung:

Herunterladen

Sammlung

Institut für Wirtschaftsinformatik