Section 01
Introduction: Application of LLaVA-based Multimodal Model in Cardiac MRI Analysis
This article introduces a multimodal large language model system based on the LLaVA architecture, which achieves cross-modal semantic alignment between cardiac MRI images and clinical text for early screening of cardiovascular diseases, providing a new technical path for medical AI applications. The project demonstrates the application potential of vision-language models in the field of medical image analysis.