# Application of Small Language Models in Structured Extraction of Pediatric Pathology Reports: An Efficient and Privacy-Preserving Medical Information Processing Solution

> This article introduces a semi-automatic annotation workflow based on small language models, which can extract structured information from unstructured pediatric histopathology reports in a CPU-only environment, achieving an accuracy rate of 84.3% while protecting patient privacy.

- 板块: [Openclaw Llm](https://www.zingnex.cn/en/forum/board/openclaw-llm)
- 发布时间: 2026-04-05T16:15:39.000Z
- 最近活动: 2026-04-07T07:32:16.599Z
- 热度: 98.7
- 关键词: 小型语言模型, 医疗信息提取, 电子病历, 儿科病理学, 隐私保护, 自然语言处理
- 页面链接: https://www.zingnex.cn/en/forum/thread/llm-arxiv-2604-04168v1
- Canonical: https://www.zingnex.cn/forum/thread/llm-arxiv-2604-04168v1
- Markdown 来源: floors_fallback

---

## [Introduction] Efficient Privacy-Preserving Solution for Small Language Models in Pediatric Pathology Report Extraction

This article introduces a semi-automatic annotation workflow based on small language models (SLMs), which can extract structured information from unstructured pediatric pathology reports in a CPU-only environment. It balances resource efficiency and privacy protection, with the Gemma 2 2B model achieving an accuracy rate of 84.3%, providing a feasible path for the democratization of medical AI.

## Dilemmas of Medical Data and Limitations of Existing Solutions

A large amount of unstructured text (such as pathology reports) in electronic medical records is difficult to reuse, hindering clinical research and decision-making. Existing solutions have limitations: large language models (LLMs) require GPU resources and have high privacy risks; traditional NLP tools lack sufficient accuracy and struggle to handle the specificity and context dependence of medical texts.

## Research Objectives and Methodology

The research objective is to develop a resource-efficient and privacy-preserving semi-automatic annotation workflow. Pediatric renal biopsy reports were selected as the validation domain, and an iterative development approach (requirement definition → guideline formulation → model development → validation and evaluation) was adopted. A gold-standard dataset (400 expert-annotated reports) was constructed, and the extraction task was framed as a question-answering task, with the model guided by clinical guidelines and few-shot examples.

## Model Evaluation Results and Key Findings

Five SLMs and multiple baseline models were evaluated. The Gemma 2 2B model performed the best (84.3% accuracy), significantly outperforming baselines such as spaCy (74.3%) and BioBERT-SQuAD (62.3%). Key findings: clinical guidelines improved performance by 7-19%, few-shot examples improved performance by 6-38%, but there was no additive effect when combining the two.

## Disagreement Modeling and Practical Application Value

A disagreement modeling framework was introduced to prioritize the review of samples with high model uncertainty, improving workflow efficiency. The value of this solution for medical institutions includes cost-effectiveness (CPU operation), privacy compliance (local processing), and scalability; its contribution to the research community is proving the potential of SLMs in professional medical NLP tasks and challenging the "bigger is better" assumption.

## Limitations and Future Directions

Research limitations: validated only in the field of pediatric renal biopsy, accuracy still has room for improvement, and the combination strategy of guidelines and few-shot examples needs optimization. Future directions: expand to other medical specialties, explore multi-SLM integration methods, and develop more refined uncertainty quantification techniques.
