Section 01
[Introduction] DermEVAL: A Dermatologist-Reviewed Multimodal Large Language Model Evaluation Benchmark
DermEVAL is a dermatologist-reviewed multimodal large language model (MLLM) evaluation benchmark focused on the field of dermatology. Addressing challenges such as high professional barriers, significant safety risks, and strong domain specificity in medical AI evaluation, it provides a professional and reliable testing platform to assess MLLMs' capabilities in medical image understanding and clinical reasoning, and promotes the responsible development of medical AI.