Section 01
Introduction to STARS: A New Breakthrough in Inference-Time Alignment
STARS proposes a new method to align the outputs of large language models (LLMs) during inference without additional training. Using a segment-wise rejection sampling strategy, it significantly improves the quality and safety of model outputs while maintaining generation efficiency, filling the gap between Vanilla decoding (no alignment) and Best-of-N (high overhead).