Reading

MAPLE: Automating QSP Model Parameter Extraction from Literature Using Large Language Models

A structured pipeline tool that uses LLMs to extract quantitative pharmacology parameters from scientific literature, generates informative prior distributions via Bayesian inference, and addresses data challenges in QSP model calibration.

定量系统药理学QSP模型文献挖掘贝叶斯推理NumPyro参数校准LLM应用药物研发

Published 2026-04-09 06:14Recent activity 2026-04-09 06:19Estimated read 6 min

MAPLE: Automating QSP Model Parameter Extraction from Literature Using Large Language Models

Section 01

Introduction: MAPLE—An LLM-Driven Tool for Automated QSP Model Parameter Extraction

MAPLE is a structured pipeline tool that uses Large Language Models (LLMs) to extract quantitative pharmacology parameters from scientific literature. It generates informative prior distributions via Bayesian inference, addressing challenges like scattered data and heterogeneous sources in Quantitative Systems Pharmacology (QSP) model calibration, and supports parameter extraction and model building in drug development.

Section 02

Project Background and Core Issues

QSP models contain numerous biological parameters that cannot be directly measured clinically. Relevant data are scattered across hundreds of literatures, with sources covering different species and indications, diverse formats, and difficulties in conversion. Traditional manual processing is error-prone; MAPLE achieves automated standardization through LLM-assisted extraction and statistical inference.

Section 03

Core Methods and Architecture Design

MAPLE uses a two-stage calibration pipeline:

Literature extraction and validation (LLM + Pydantic to generate YAML files) + joint MCMC inference to generate sub-model priors
SBI inference combining clinical data and QSP simulators The innovation lies in quantifying data source quality (evaluated via 8 dimensions) and adjusting data source weights through translation sigma—for example, mouse data has lower weight than human clinical data.

Section 04

Technical Implementation Details

YAML Structure: Structured association between literature measurements and model parameters, including target ID, input, calibration rules, source relevance, etc.
Forward Model Types: Supports multiple types like algebraic formulas, dose-response fitting, ODE systems, etc.
Nuisance Parameter Handling: Mark additional parameters as nuisance; estimate them in MCMC but exclude from final output.
Batch Pipeline: Stages like literature search, PDF collection, evaluation, extraction, validation; supports caching mechanism.
Input Format: Target parameter CSV must include ID, parameter, cancer type, and search annotations (e.g., specific keywords).

Section 05

Usage Methods

Coding Assistant Collaboration: Collaborate with coding assistants via MCP protocol to call tools like Claude/Codex for automatic literature search, extraction, and YAML validation; users are responsible for review.
Python API: Call the process_targets function to handle priors_csv and yaml files.
Batch Extraction: Process large-scale parameters in parallel stages; results of each stage are independently cached.
Best Practices: Annotation fields should include rate formulas and specific search terms (e.g., "MVD growth kinetics") to improve search efficiency.

Section 06

Application Scenarios and Value

Applicable to:

QSP model building for new drug development
Model recalibration (updating parameters with new data)
Cross-species/indication model transfer
Regulatory submission support (parameter traceability and uncertainty quantification) Significantly lowers the threshold for QSP model calibration, enabling more teams to build high-quality models.

Section 07

Summary and Outlook

MAPLE uses LLMs as information extraction tools to assist scientists in validation and interpretation work, integrating literature extraction, statistical inference, and uncertainty quantification into a standardized pipeline. With the improvement of LLM capabilities in the future, such tools will play a more important role in biomedical research.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

Folkering OS: When the Operating System Itself Is AI—A Self-Evolving Bare-Metal Rust System

Folkering OS is the world's first AI-native bare-metal operating system, entirely written in Rust no_std without relying on Linux, POSIX, or libc. It can generate commands from scratch, compile them into WASM, and run them in 10 seconds, achieving true self-evolution.

Recent activity 2026-04-09 16:15