Section 01
[Introduction] Small Language Models Empower Large Model Prompt Disambiguation: A New Low-Cost, High-Efficiency Approach to Inference Optimization
The research team proposes an innovative prompt optimization method that uses small language models (SLMs) to perform semantic disambiguation on ambiguous prompts before inference. This method can improve large model inference performance by 2.5 percentage points at a cost of only $0.02. By moving prompt optimization to the preprocessing stage, it avoids interfering with the internal mechanisms of large models, providing a low-cost and efficient optimization approach for large model applications.