Revisiting the trolley problem for AI: biases and stereotypes in Large Language Models and their impact on ethical decision-making

dc.contributor.authorHatemo, Sahan
dc.contributor.authorWeickhardt, Christof
dc.contributor.authorGisler, Luca
dc.contributor.authorBendel, Oliver
dc.contributor.editorPetrick, Ron
dc.contributor.editorGeib, Christopher
dc.date.accessioned2025-07-28T13:04:55Z
dc.date.issued2025
dc.description.abstractThe trolley problem has long served as a lens for exploring moral decision-making, now gaining renewed significance in the context of artificial intelligence (AI). This study investigates ethical reasoning in three open-source large language models (LLMs)—LLaMA, Mistral and Qwen—through variants of the trolley problem. By introducing demographic prompts (age, nationality and gender) into three scenarios (switch, loop and footbridge), we systematically evaluate LLM responses against human survey data from the Moral Machine experiment. Our findings reveal notable differences: Mistral exhibits a consistent tendency to overintervene, while Qwen chooses to intervene less and LLaMA balances between the two. Notably demographic attributes, particularly nationality, significantly influence LLM decisions, exposing potential biases in AI ethical reasoning. These insights underscore the necessity of refining LLMs to ensure fairness and ethical alignment, leading the way for more trustworthy AI systems.
dc.event2025 AAAI Spring Symposium
dc.event.end2025-04-02
dc.event.start2025-03-31
dc.identifier.doi10.1609/aaaiss.v5i1.35590
dc.identifier.urihttps://irf.fhnw.ch/handle/11654/52099
dc.language.isoen
dc.publisherAAAI Press
dc.relation.ispartofProceedings of the 2025 AAAI Spring Symposium Series
dc.spatialSan Francisco
dc.subject.ddc330 - Wirtschaft
dc.subject.ddc005 - Computer Programmierung, Programme und Daten
dc.titleRevisiting the trolley problem for AI: biases and stereotypes in Large Language Models and their impact on ethical decision-making
dc.type04B - Beitrag Konferenzschrift
dspace.entity.typePublication
fhnw.InventedHereYes
fhnw.ReviewTypeAnonymous ex ante peer review of a complete publication
fhnw.affiliation.hochschuleHochschule für Wirtschaft FHNWde_CH
fhnw.affiliation.institutInstitut für Wirtschaftsinformatikde_CH
fhnw.openAccessCategoryClosed
fhnw.pagination213-219
fhnw.publicationStatePublished
relation.isAuthorOfPublicationd8c9d823-cabc-40f0-85d9-e5321a887f22
relation.isAuthorOfPublication41261aa9-9368-4700-8adf-f09487bf7e7e
relation.isAuthorOfPublication87460844-8df7-4204-80ad-a33ed72cc96c
relation.isAuthorOfPublication47ab0867-6bcc-4476-9891-def80a6fcc9b
relation.isAuthorOfPublication.latestForDiscoveryd8c9d823-cabc-40f0-85d9-e5321a887f22
Dateien

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
license.txt
Größe:
2.66 KB
Format:
Item-specific license agreed upon to submission
Beschreibung: