Section 01
TRLawBench: Introduction to the Large Language Model Evaluation Benchmark for the Turkish Legal Domain
TRLawBench is a large language model evaluation benchmark designed for the Turkish legal domain. It aims to assess AI models' legal reasoning capabilities and knowledge mastery using real questions from official Turkish exams. This benchmark fills the gap in Turkish legal AI evaluation, adopting two evaluation modes (standard mode and reasoning mode). Preliminary tests show that advanced models still have room for improvement in accuracy on this benchmark, which is of great significance for promoting the professionalization and localization of legal AI.