Reading

Identifiable Victim Effect in Large Language Models: When Narrative Trumps Numbers

This article explores the cognitive bias exhibited by large language models (LLMs) when making decisions involving human lives—the Identifiable Victim Effect (IVE)—and how model alignment and reasoning capabilities amplify this bias.

大语言模型认知偏差可识别受害者效应AI对齐RLHF决策科学AI伦理行为经济学

Published 2026-05-03 23:13Recent activity 2026-05-03 23:20Estimated read 6 min

Identifiable Victim Effect in Large Language Models: When Narrative Trumps Numbers

Section 01

[Introduction] Identifiable Victim Effect in Large Language Models: How Narrative Trumps Numbers

This article explores the 'Identifiable Victim Effect' (IVE) in large language models (LLMs) when making decisions involving human lives—i.e., the tendency to prioritize saving specific, identifiable individuals over a larger number of abstract groups. The study found that models trained with RLHF alignment exhibit more pronounced biases, and stronger reasoning capabilities may instead 'rationalize' emotion-driven choices. The article also analyzes the underlying mechanisms, practical application impacts, and mitigation strategies, reminding us of the need to strike a balance between AI 'humanization' and rational fairness.

Section 02

Background: What is the Identifiable Victim Effect?

The Identifiable Victim Effect was proposed by Thomas Schelling in 1968, describing how humans feel far more compassion for specific individuals than for abstract groups. In classic experiments, showing stories and photos of a specific child generated more donations than statistical data like 'thousands of children waiting for organ transplants', revealing that human decisions are easily triggered by concrete narratives rather than numbers.

Section 03

Experimental Design: Methods to Verify IVE in LLMs

The research team used binary choice tasks, controlling three variables:

Identifiability: One option includes a specific individual's name and story, while the other only uses statistical numbers;
Quantity: The number of lives in the identifiable option is fewer than that in the statistical option;
Scenario: Medical allocation, disaster relief, etc. The IVE bias of LLMs was quantified through multiple sets of controlled experiments.

Section 04

Research Evidence: IVE Performance in LLMs and Key Findings

Experimental results show:

Basic IVE Exists: Under zero-shot prompts, the probability of models choosing identifiable individuals is significantly higher than random;
Alignment Amplification Effect: The bias of RLHF-trained models is 15-30% higher than that of pre-trained models;
Double-Edged Sword of Reasoning: After enabling Chain-of-Thought reasoning, models use complex reasoning to 'rationalize' emotion-driven choices (motivated reasoning).

Section 05

Deep Mechanisms: Reasons for IVE Bias in LLMs

Sources of bias include:

Training Data: Massive texts contain human narrative preferences, so models learn to focus on stories;
Tension in Alignment Goals: When RLHF pursues 'helpfulness', it strengthens responses to human emotional needs, inadvertently amplifying bias;
Interaction Between Reasoning and Bias: When models 'think step by step', they elaborate on the value of identifiable individuals and find reasons for emotional preferences, similar to human motivated reasoning.

Section 06

Practical Implications and Mitigation Strategies

Application Impacts:

Medical Decision-Making: Resources may be skewed toward 'story-rich' cases;
Disaster Response: Prioritize media-exposed events;
Policy Analysis: Underestimate the value of systematic solutions. Mitigation Directions:

Explicit debiasing prompts;
Adversarial training;
Multi-model integration;
Human-machine collaborative review.

Section 07

Broader Implications and Conclusion

The study reveals a paradox: Making AI 'human-like' may inherit human biases. We need to consider alignment goals (imitating humans vs. being more rational and fair) and the trade-off between efficiency and fairness. Technology needs to integrate psychology and behavioral economics. The conclusion reminds us: AI is not a purely rational machine; when deployed in high-risk scenarios, we need to address biases, balance 'being like humans' and 'being rational', and serve human well-being. At the same time, we should reflect: Can humans themselves make fair decisions?

Identifiable Victim Effect in Large Language Models: When Narrative Trumps Numbers

[Introduction] Identifiable Victim Effect in Large Language Models: How Narrative Trumps Numbers

Background: What is the Identifiable Victim Effect?

Experimental Design: Methods to Verify IVE in LLMs

Research Evidence: IVE Performance in LLMs and Key Findings

Deep Mechanisms: Reasons for IVE Bias in LLMs

Practical Implications and Mitigation Strategies

Broader Implications and Conclusion

Continue Reading

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Folkering OS: When the Operating System Itself Is AI—A Self-Evolving Bare-Metal Rust System

LLM-assisted-analysis: A New Approach to Detecting Logical Vulnerabilities in Smart Contracts Using Large Language Models

Building Modern LLM from Scratch: A Tutorial-level Implementation of Llama-style Language Model