Section 01
DOT Technology Guide: Dynamic Outlier Truncation Empowers Efficient Reasoning Model Training
DOT (Dynamic Outlier Truncation Technology) is the official code implementation of a paper accepted by ACL 2026. It proposes a dynamic outlier truncation method to address the "length bias" problem in reasoning model training. While maintaining model performance, it significantly improves training efficiency and stability, providing a new solution for reasoning model training optimization.