Zing Forum

Reading

JointCoder: A Multi-Agent Framework-Based Automatic ICD Coding System for Chinese Medical Records

An open-source project for automatic ICD coding of real-world Chinese electronic medical records, using a multi-agent framework to jointly predict ICD-10 disease codes and ICD-9-CM-3 surgical procedure codes, including real hospital datasets and a complete demonstration system.

医疗AIICD编码多智能体电子病历自然语言处理中文NLP机器学习临床决策支持医疗信息化
Published 2026-05-08 09:45Recent activity 2026-05-08 10:35Estimated read 5 min
JointCoder: A Multi-Agent Framework-Based Automatic ICD Coding System for Chinese Medical Records
1

Section 01

JointCoder Project Introduction: A Multi-Agent Framework-Driven Automatic ICD Coding System for Chinese Medical Records

JointCoder is an open-source project for automatic ICD coding of real-world Chinese electronic medical records. Its core uses a multi-agent framework to achieve joint prediction of ICD-10 disease codes and ICD-9-CM-3 surgical procedure codes. It also provides real hospital datasets and a complete web demonstration system, aiming to solve the industry pain points of low efficiency and high error rate in manual coding.

2

Section 02

Project Background: Automation Challenges of Medical ICD Coding

ICD coding is a core part of medical informatization. It requires professionals to convert clinical diagnoses and surgical procedures into standard codes, but manual coding is time-consuming, labor-intensive, and easily affected by subjective factors. JointCoder provides a solution to this pain point, supporting joint prediction of disease and surgical codes for Chinese electronic medical records.

3

Section 03

Core Innovation: Multi-Agent Collaborative Coding Framework

JointCoder adopts a multi-stage multi-agent architecture aligned with workflow, decomposing the coding task into multiple collaborative stages, each handled by a dedicated agent (different from traditional end-to-end single models). This design improves interpretability (intermediate results can be reviewed) and controllability, and can flexibly introduce domain knowledge. It also supports joint prediction of disease and surgical codes, capturing their correlations to improve overall accuracy.

4

Section 04

Dataset and Performance

The project provides a desensitized dataset from real tertiary hospitals (6747 inpatient records, including ICD-10/ICD-9-CM-3 codes), which has authenticity, completeness, and Chinese characteristics. Performance evaluation shows: JointCoder outperforms traditional machine learning, deep learning, and commercial APIs in disease and surgical coding tasks; joint coding achieves better results; ablation experiments verify the positive contributions of components like multi-agent collaboration and workflow alignment.

5

Section 05

System Functions and Demonstration

JointCoder provides a complete web demonstration system with four functional modules: 1. Demo case selection (4 preloaded samples, supporting OCR/desensitization/editing/parallel coding); 2. Single PDF upload (local PDF processing); 3. Batch PDF upload; 4. Excel batch submission (no OCR needed, higher efficiency). After coding, it supports feedback, marking as completed, and result export.

6

Section 06

Application Prospects and Limitations

Application Prospects: Hospitals can improve coding efficiency and accuracy; medical insurance departments can support reasonable payment and disease monitoring; medical research can help with epidemiological analysis and clinical decision support system development. Limitations: Data privacy (upload is disabled in public demonstrations), domain adaptability (needs fine-tuning to adapt to other hospitals/departments), coding rules need regular updates and maintenance.

7

Section 07

Open-Source Contributions and Community Value

JointCoder is released as open-source on GitHub, providing code, pre-trained models, detailed documentation, and examples. Open-source lowers the threshold for Chinese medical AI research, provides the community with real datasets, multi-agent frameworks, and demonstration systems, and promotes the development of the field.