Reading

DPC: A Training-Free Dual-Paradigm Consistency Framework for Text-to-SQL Candidate Selection

DPC achieves SQL candidate selection without training by constructing a Minimal Distinction Database and cross-paradigm validation, with an accuracy improvement of up to 2.2%.

Text-to-SQL候选选择多智能体跨范式验证LLM评估代码生成

Published 2026-04-16 23:44Recent activity 2026-04-17 11:22Estimated read 7 min

Section 01

DPC: A Training-Free Dual-Paradigm Consistency Framework for Text-to-SQL Candidate Selection (Introduction)

DPC (Dual-Paradigm Consistency) is a training-free Text-to-SQL candidate selection framework that corely addresses the "generation-selection gap" problem of LLMs in Text-to-SQL tasks. It achieves an accuracy improvement of up to 2.2% by constructing a Minimal Distinction Database (MDD) and a cross-paradigm validation strategy. This thread will introduce its background, methods, experimental results, and application prospects in separate floors.

Section 02

The Generation-Selection Gap in Text-to-SQL and Dilemmas of Existing Solutions (Background)

The Generation-Selection Gap in Text-to-SQL

Although LLMs can generate SQL, they lack self-evaluation ability without an execution environment, leading to high Pass@K but low Pass@1 (only one can be selected in practical applications), forming the "generation-selection gap."

Dilemmas of Existing Solutions

Supervised Verifiers: Require large amounts of labeled data, prone to overfitting, and have poor generalization;
Self-Consistency: If the model has systematic biases, it is prone to "consensus hallucination";
LLM-as-a-Judge: Lack symbolic execution ability and are easily misled by surface features.

Section 03

Core Ideas of DPC and Multi-Agent Collaboration Architecture (Methods)

The core idea of DPC is to transform SQL selection from probabilistic guessing into deterministic verification with visible data. It adopts a three-agent collaboration architecture:

SLICER: Analyzes the structure of SQL candidates and identifies key differences;
TESTER: Constructs a Minimal Distinction Database (MDD) based on the differences;
SOLVER: Performs cross-paradigm validation (converts SQL to Python/Pandas) and compares the consistency of execution results.

Section 04

Innovative Design of Minimal Distinction Database and Cross-Paradigm Validation (Method Details)

Minimal Distinction Database (MDD)

MDD is a key innovation of DPC, with the following features:

Adversarial design: Constructed in a targeted manner to maximize candidate differences;
Fully observable: Data is visible and controllable, with transparent verification;
Minimal scale: Contains only the minimum data needed for distinction, reducing costs;
Logic-focused: Precisely exposes differences in boundary conditions, null value handling, etc.

Cross-Paradigm Validation

Implemented by converting SQL to Python/Pandas:

Paradigm independence: Declarative (SQL) and imperative (Python) syntax have different thinking patterns; if results are consistent, the credibility is high;
Bias offset: Different paradigms are less likely to have the same systematic biases;
Executable verification: Pandas provides a reliable execution foundation.

Section 05

Experimental Verification and Performance (Evidence)

Experimental Results

Evaluated on mainstream LLMs using BIRD and Spider benchmarks:

Accuracy improvement: 2.2% higher than baselines like Self-Consistency;
Consistency advantage: Stable across different models and query difficulties;
Training-agnostic: No training required, with strong generality.

Ablation Experiments

Necessity of MDD: Replacing MDD with random data leads to a significant performance drop;
Value of cross-paradigm: Single-paradigm verification has low accuracy;
Multi-agent collaboration: Individual agents cannot achieve the performance of the complete system.

Section 06

Technical Insights and Application Prospects (Conclusions and Recommendations)

Technical Insights

From guessing to verification: Constructing a controllable test environment is more reliable than relying on probabilistic outputs;
Cross-paradigm redundancy: Cross-validation reduces the risk of systematic biases;
Agent specialization: Decomposing tasks to specialized agents improves performance;
Interpretability: The verification process is transparent, and decision reasons can be traced.

Application Prospects

Code generation: Can be extended to other executable code scenarios;
Formal verification: Explore high-reliability fields by combining with symbolic execution;
Multi-agent systems: The collaboration architecture can be extended to more AI scenarios.

Section 07

Limitations and Future Outlook (Supplementary)

Limitations

MDD construction has computational overhead; complex queries may lead to state space explosion;
Cross-paradigm validation requires a reliable execution environment for the target language.

Conclusion

DPC provides a new idea for solving the Text-to-SQL generation-selection gap, and its method combining MDD and cross-paradigm validation is of great significance in building reliable intelligent systems.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

Folkering OS: When the Operating System Itself Is AI—A Self-Evolving Bare-Metal Rust System

Folkering OS is the world's first AI-native bare-metal operating system, entirely written in Rust no_std without relying on Linux, POSIX, or libc. It can generate commands from scratch, compile them into WASM, and run them in 10 seconds, achieving true self-evolution.

Recent activity 2026-04-09 16:15