Section 01
[Introduction] Fundus-R1: The First Knowledge-Aware Multimodal Large Model for Fundus Images Trained on Public Data
This article introduces the Fundus-R1 model, the first multimodal large model for fundus image analysis trained exclusively on public datasets. Using RAG to generate knowledge-aware reasoning chains and RLVR enhanced by process rewards, it outperforms general-purpose models on multiple benchmarks. This model addresses the barrier of existing fundus MLLMs relying on internal data, providing a new path for the democratization of medical AI.