Section 01
Introduction: Core of the Study on Multi-Dimensional Representation Mechanism of Rhetorical Questions in LLMs
This study uses linear probing technology to explore the internal representation mechanism of rhetorical questions in LLMs. Key findings include: Rhetorical signals emerge in the early layers of the model, and the representation of the last token is the most stable; rhetorical questions are encoded along multiple linear directions in the representation space, and probes trained on different datasets capture different rhetorical phenomena; cross-dataset transfer is detectable but has differences, revealing LLMs' multi-dimensional understanding of rhetorical questions.