Section 01
[Introduction] ReProbe: Efficient Test-Time Reasoning Expansion via Probing Internal States of LLMs
ReProbe is the official implementation of a paper accepted by ACL 2026, proposing a new method to efficiently expand test-time reasoning by probing the internal states of Large Language Models (LLMs). Its core idea is to use the internal hidden states of LLMs to guide the reasoning process, intelligently allocate computing resources, and reduce computational overhead while maintaining reasoning quality. Compared to traditional test-time expansion methods, ReProbe has significant advantages in computational efficiency, expansion accuracy, and generality, and performs excellently on multiple benchmark tasks such as mathematical reasoning, logical reasoning, and code generation.