Revisiting the trolley problem for AI: biases and stereotypes in Large Language Models and their impact on ethical decision-making

Hatemo, Sahan; Weickhardt, Christof; Gisler, Luca; Bendel, Oliver

Revisiting the trolley problem for AI: biases and stereotypes in Large Language Models and their impact on ethical decision-making

Autor:innen

Autor:in (Körperschaft)

Publikationsdatum

2025

Typ der Arbeit

Studiengang

Sammlung

Institut für Wirtschaftsinformatik

Komplettanzeige

Typ

04B - Beitrag Konferenzschrift

Herausgeber:innen

Petrick, Ron

Geib, Christopher

Herausgeber:in (Körperschaft)

Betreuer:in

Übergeordnetes Werk

Proceedings of the 2025 AAAI Spring Symposium Series

Themenheft

DOI der Originalpublikation

https://doi.org/10.1609/aaaiss.v5i1.35590

URI

https://irf.fhnw.ch/handle/11654/52099

Link

Zugehörige Forschungsdaten

Reihe / Serie

Reihennummer

Jahrgang / Band

Ausgabe / Nummer

Seiten / Dauer

213-219

Patentnummer

Verlag / Herausgebende Institution

AAAI Press

Verlagsort / Veranstaltungsort

San Francisco

Auflage

Version

Programmiersprache

Abtretungsempfänger:in

Praxispartner:in/Auftraggeber:in

Zusammenfassung

The trolley problem has long served as a lens for exploring moral decision-making, now gaining renewed significance in the context of artificial intelligence (AI). This study investigates ethical reasoning in three open-source large language models (LLMs)—LLaMA, Mistral and Qwen—through variants of the trolley problem. By introducing demographic prompts (age, nationality and gender) into three scenarios (switch, loop and footbridge), we systematically evaluate LLM responses against human survey data from the Moral Machine experiment. Our findings reveal notable differences: Mistral exhibits a consistent tendency to overintervene, while Qwen chooses to intervene less and LLaMA balances between the two. Notably demographic attributes, particularly nationality, significantly influence LLM decisions, exposing potential biases in AI ethical reasoning. These insights underscore the necessity of refining LLMs to ensure fairness and ethical alignment, leading the way for more trustworthy AI systems.

Schlagwörter

Fachgebiet (DDC)

330 - Wirtschaft
005 - Computer Programmierung, Programme und Daten

Projekt

Veranstaltung

2025 AAAI Spring Symposium

Startdatum der Ausstellung

Enddatum der Ausstellung

Startdatum der Konferenz

31.03.2025

Enddatum der Konferenz

02.04.2025

Datum der letzten Prüfung

ISBN

ISSN

Sprache

Englisch

Während FHNW Zugehörigkeit erstellt

Ja

Zukunftsfelder FHNW

Publikationsstatus

Veröffentlicht

Begutachtung

peer-reviewed

Open Access-Status

Closed

Lizenz

Zitation

Hatemo, S., Weickhardt, C., Gisler, L., & Bendel, O. (2025). Revisiting the trolley problem for AI: biases and stereotypes in Large Language Models and their impact on ethical decision-making. In R. Petrick & C. Geib (Eds.), Proceedings of the 2025 AAAI Spring Symposium Series (pp. 213–219). AAAI Press. https://doi.org/10.1609/aaaiss.v5i1.35590

Komplettanzeige