Zing Forum

Reading

Application of Small Language Models in Structured Extraction of Pediatric Pathology Reports: An Efficient and Privacy-Preserving Medical Information Processing Solution

This article introduces a semi-automatic annotation workflow based on small language models, which can extract structured information from unstructured pediatric histopathology reports in a CPU-only environment, achieving an accuracy rate of 84.3% while protecting patient privacy.

小型语言模型医疗信息提取电子病历儿科病理学隐私保护自然语言处理
Published 2026-04-06 00:15Recent activity 2026-04-07 15:32Estimated read 5 min
Application of Small Language Models in Structured Extraction of Pediatric Pathology Reports: An Efficient and Privacy-Preserving Medical Information Processing Solution
1

Section 01

[Introduction] Efficient Privacy-Preserving Solution for Small Language Models in Pediatric Pathology Report Extraction

This article introduces a semi-automatic annotation workflow based on small language models (SLMs), which can extract structured information from unstructured pediatric pathology reports in a CPU-only environment. It balances resource efficiency and privacy protection, with the Gemma 2 2B model achieving an accuracy rate of 84.3%, providing a feasible path for the democratization of medical AI.

2

Section 02

Dilemmas of Medical Data and Limitations of Existing Solutions

A large amount of unstructured text (such as pathology reports) in electronic medical records is difficult to reuse, hindering clinical research and decision-making. Existing solutions have limitations: large language models (LLMs) require GPU resources and have high privacy risks; traditional NLP tools lack sufficient accuracy and struggle to handle the specificity and context dependence of medical texts.

3

Section 03

Research Objectives and Methodology

The research objective is to develop a resource-efficient and privacy-preserving semi-automatic annotation workflow. Pediatric renal biopsy reports were selected as the validation domain, and an iterative development approach (requirement definition → guideline formulation → model development → validation and evaluation) was adopted. A gold-standard dataset (400 expert-annotated reports) was constructed, and the extraction task was framed as a question-answering task, with the model guided by clinical guidelines and few-shot examples.

4

Section 04

Model Evaluation Results and Key Findings

Five SLMs and multiple baseline models were evaluated. The Gemma 2 2B model performed the best (84.3% accuracy), significantly outperforming baselines such as spaCy (74.3%) and BioBERT-SQuAD (62.3%). Key findings: clinical guidelines improved performance by 7-19%, few-shot examples improved performance by 6-38%, but there was no additive effect when combining the two.

5

Section 05

Disagreement Modeling and Practical Application Value

A disagreement modeling framework was introduced to prioritize the review of samples with high model uncertainty, improving workflow efficiency. The value of this solution for medical institutions includes cost-effectiveness (CPU operation), privacy compliance (local processing), and scalability; its contribution to the research community is proving the potential of SLMs in professional medical NLP tasks and challenging the "bigger is better" assumption.

6

Section 06

Limitations and Future Directions

Research limitations: validated only in the field of pediatric renal biopsy, accuracy still has room for improvement, and the combination strategy of guidelines and few-shot examples needs optimization. Future directions: expand to other medical specialties, explore multi-SLM integration methods, and develop more refined uncertainty quantification techniques.