Section 01
Introduction to GMAI-VL: 7B-Parameter Medical Vision-Language Model Surpasses 34B-Large Models
GMAI-VL is a vision-language model specifically designed for the medical field. With only 7 billion parameters, it achieves an accuracy of 88.48% on the OmniMedVQA benchmark, surpassing models with 5 times more parameters. The project also open-sources a 5.5 million medical multimodal dataset, providing new solutions for the medical AI field.