Section 01
[Introduction] ShoggothBench: A Benchmark Framework for Quantifying LLM Personality Deviation and Uninterpretability
Title: ShoggothBench: Quantifying Personality Deviation and Behavioral Uninterpretability of Large Language Models Abstract: ShoggothBench is a benchmark framework for measuring behavioral deviations of large language models under role pressure. By comparing differences between declared personality, other personality patterns, general strategy behaviors, and residual uninterpreted behaviors, it helps identify elusive "Shoggoth candidate" behavioral patterns. Source Information: Original author/maintainer: nikakogho; Source platform: GitHub; Original link: https://github.com/nikakogho/ShoggothBench; Release date: 2026-05-31 Core Value: Provides AI safety researchers with a quantifiable tool to explore the consistency between the model's internal mechanisms and its surface personality settings.