Section 01
Core Introduction to the ClinicRealm Study
ClinicRealm, a large-scale benchmark study published in npj Digital Medicine, systematically compared the performance of 15 GPT-style LLMs, 5 BERT models, and 11 traditional methods on non-generative clinical prediction tasks. It reveals that modern LLMs can outperform traditionally fine-tuned models in zero-shot settings, and leading open-source LLMs can match or even exceed proprietary commercial models, providing new evidence for medical AI selection.