Section 01
TTA*: Guide to the New Paradigm of Small Model Reasoning Without Fine-Tuning
TTA* (Test-Time A* Search) is a new reasoning enhancement method for small language models. Its core lies in transforming multi-step reasoning into goal-oriented tree search, guiding the model to self-improve during reasoning via the cost function of the A* algorithm. This method enhances the complex reasoning capabilities of small models without fine-tuning or external reward models, providing a new idea for model optimization in resource-constrained scenarios.