Section 01
[Introduction] Echo-α: An Ultrasound Imaging Agent Model Integrating Lesion Localization and Clinical Reasoning
Echo-α is an agent-based multimodal reasoning model designed specifically for ultrasound image interpretation. It integrates lesion localization and clinical reasoning capabilities via an invoke-and-reason framework, achieving leading performance in multi-center renal and breast ultrasound benchmark tests. This model core addresses the long-standing problem in medical imaging AI where precise lesion localization and holistic clinical reasoning are hard to achieve simultaneously. It uses a two-stage training strategy to optimize performance and has open-sourced its code for subsequent research.