Section 01
NucBench: Introduction to the First Multimodal LLM Evaluation Benchmark for Nuclear Engineering
NucBench is the first open-source multimodal large language model evaluation benchmark for the nuclear engineering field, developed by the team from the University of Sharjah. It includes approximately 4292 multiple-choice questions from the Reactor Operator License Exam (GFE), over 100 mixed-type questions from undergraduate nuclear engineering exams, and a two-phase flow regime image recognition dataset, aiming to provide a standardized test for evaluating LLMs' knowledge mastery and reasoning abilities in the nuclear engineering field.