Section 01
Introduction: MGDA-Decoupled Enables Fair Multi-Objective Alignment for LLMs
MGDA-Decoupled is a geometry-aware multi-objective optimization algorithm designed specifically for the DPO framework. It aims to balance multiple objectives in LLM alignment (e.g., usefulness, truthfulness, harmlessness) and avoid procedural unfairness caused by fixed weights. This method achieves the highest win rate on the UltraFeedback dataset, providing a new path for building fair and balanced AI systems.