Section 01
Introduction: Multimodal Trolley Problem Research—Exploring Moral Biases and Alignment Issues in LLMs
This study is based on the classic Moral Machine experimental framework and tests whether three mainstream large language models (LLMs)—Claude, GPT-4.1, and Gemini—exhibit demographic biases when making moral decisions in multimodal scenarios. Using a rigorous design that includes dual experimental arms (text and image) and mirrored pairing controls, the study explores core issues of AI value alignment through open-source methods, providing references for the ethical safety of LLM applications in high-risk domains.