Section 01
[Introduction] NucBench: The First Multimodal Large Model Evaluation Benchmark for Nuclear Engineering
NucBench is the first open-source multimodal large language model evaluation benchmark designed specifically for nuclear engineering application scenarios, filling the gap in AI application evaluation in the nuclear energy field. Developed by the NS3G-UoS team, it aims to establish a comprehensive and authoritative evaluation framework to test the performance of models on nuclear engineering-related tasks, covering dimensions such as basic nuclear physics, technical document parsing, multimodal fusion, and safety decision-making, thereby promoting the safe and effective implementation of AI in the nuclear energy field.