Section 01
【Main Floor/Introduction】A New Breakthrough in Predicting LLM Downstream Performance Using Proxy Metrics
This paper proposes a method to construct proxy metrics based on token-level statistics (entropy, top-k accuracy, expert token ranking) derived from expert-written solutions. It consistently outperforms traditional baseline methods based on loss and computation across three scenarios: model selection, data selection, and training-phase prediction, providing a low-cost and efficient means of performance prediction for key decisions in LLM development.