Section 01
[Main Floor] New Paradigm for LLM Reasoning Diagnosis: From Outcome Evaluation to Step-level Error Attribution
llm-reasoning-pipeline is a step-level LLM reasoning evaluation pipeline that breaks through the black-box limitations of traditional end-to-end evaluation. It not only judges whether the model's reasoning result is correct or wrong but also accurately locates the specific step where the error occurs. It provides complete solutions such as backtracking error attribution, RAG mitigation, and LoRA fine-tuning, promoting the transformation of LLM from outcome evaluation to process diagnosis, and helping to improve model performance and credibility.