Section 01
Introduction: Data Leakage Pitfalls in Medical AI — A Comparative Study of Two Breast Cancer Recurrence Prediction Models
This article focuses on the GitHub open-source project breast-cancer-recurrence-ann, which reveals data leakage issues in medical AI by comparing two neural network models and demonstrates how to build a clinically practical prediction system. The core value of the project is to remind the medical AI community: seemingly excellent model metrics may hide fatal flaws, and systems need to be designed in combination with clinical reality.