Section 01
[Introduction] Core Overview of LLM-Driven Pathological Subtyping Research for IgA Nephropathy
This project presents a complete workflow for automatically extracting structured features from unstructured IgA nephropathy pathological reports using a large language model (DeepSeek) and defining clinically actionable subtypes through cluster analysis. The core process includes feature extraction, cleaning, embedding, clustering, and interpretability analysis, aiming to solve the problem that traditional pathological reports are difficult to directly use for data analysis and provide a new path for precision medicine. Project code is available at zhji0426/LLM-for-pathological-subtypes.