Section 01
Dual-Perspective Analysis of Self-Referential Representations in Large Language Models: Core Viewpoints Guide
This article proposes an innovative interpretability research method that combines biological topology and activation space geometry to characterize self-referential representations in large language models from dual dimensions, providing a new perspective for understanding the internal mechanisms of models. Keywords: Interpretability, Large Language Models, Self-Reference, Biological Topology, Activation Space Geometry, etc.