Section 01
[Main Floor/Introduction] Awex: An RL Training and Inference Framework Enabling Second-level Weight Synchronization for Trillion-Parameter Models
Awex is an open-source high-performance reinforcement learning weight synchronization framework developed by InclusionAI. Its core goal is to solve the parameter update latency issue between training and inference ends in reinforcement learning training such as RLHF. The framework has been validated on a 1,000-GPU cluster, supporting full weight synchronization of trillion-parameter models in 10 seconds, providing efficient collaboration capabilities for large-scale reinforcement learning training.