Section 01
Introduction: Cambridge MPhil Thesis Open-Sourced — Reproducing Anthropic's Mechanistic Interpretability Research on Qwen3-4B
Iuliia Vitiugova from the DAMPT department at the University of Cambridge recently open-sourced her master's thesis project, successfully reproducing the core methods of Anthropic's research 'On the Biology of Large Language Models' (transcoder feature extraction, attribution graph construction, causal intervention validation) on the open-source large language model Qwen3-4B. This fills a key gap in the mechanistic interpretability field of the open-source community and provides a complete reproducible technical framework for multilingual circuit analysis.