Section 01
Introduction: Unilaw-R1 — A Reinforcement Learning Large Language Model Focused on Legal Reasoning
Unilaw-R1 is the official implementation of a paper accepted by EMNLP 2025, a large language model focused on legal domain reasoning. This project combines reinforcement learning and iterative reasoning techniques, is trained on the JEC-QA dataset, and has open-sourced model weights for academic research use.