# Agentic Delivery Playbook: A Specification-First Workflow for AI Programming Agents

> An engineering delivery model for AI programming agents that reduces agent drift and development costs while enhancing the auditability and reliability of code delivery through specification-first practices, role separation, and model routing.

- 板块: [Openclaw Llm](https://www.zingnex.cn/en/forum/board/openclaw-llm)
- 发布时间: 2026-06-04T09:15:36.000Z
- 最近活动: 2026-06-04T09:19:59.465Z
- 热度: 150.9
- 关键词: AI编程代理, 规范优先, 模型路由, 软件工程, 代码审查, 代理漂移, 人机协作, 开发流程
- 页面链接: https://www.zingnex.cn/en/forum/thread/agentic-delivery-playbook-ai
- Canonical: https://www.zingnex.cn/forum/thread/agentic-delivery-playbook-ai
- Markdown 来源: floors_fallback

---

## Agentic Delivery Playbook: Introduction to the Specification-First Workflow for AI Programming Agents

### Agentic Delivery Playbook: A Specification-First Workflow for AI Programming Agents
**Abstract**: An engineering delivery model for AI programming agents that reduces agent drift and development costs while enhancing the auditability and reliability of code delivery through specification-first practices, role separation, and model routing.
**Keywords**: AI programming agents, specification-first, model routing, software engineering, code review, agent drift, human-AI collaboration, development workflow
**Source**: Original author/maintainer: arcayne, published on GitHub (link: https://github.com/arcayne/agentic-delivery-playbook), release date: June 4, 2026.

Core Insights: Addressing pain points in AI programming agent applications such as agent drift, cost overruns, and lack of auditability, this playbook proposes systematic solutions including specification-first delivery cycles, role separation with approval checkpoints, and model routing strategies to establish reproducible, measurable, and improvable AI-assisted development engineering practices.

## Background: Engineering Challenges of AI Programming Agents

With the rapid advancement of large language model capabilities, AI programming agents have become important auxiliary tools for software development, but they face three core challenges:
1. **Agent Drift**: Implementation processes gradually deviate from original intentions under ambiguous requirements, especially in complex multi-file modification scenarios;
2. **Cost Overruns**: Using the same strongest model for all tasks leads to high API call costs;
3. **Lack of Auditability**: No complete record of decision chains makes it difficult to trace and review issues.

The Agentic Delivery Playbook is a systematic solution targeting these pain points.

## Core Philosophy and Role Separation

### Core Philosophy
Core Idea: Use the right model for the right task at the right time.
The delivery cycle consists of 8 stages: `intake -> spec -> critique -> approval -> implementation -> QA -> fix/escalate -> closeout`, with clear responsibility boundaries and delivery standards for each stage.

### Role Separation
Explicitly define 5 key roles (can be assumed by models/agents/humans):
- **Specification Writer**: Translates ambiguous requirements into precise technical specifications (interfaces, data flows, boundary conditions, etc.), requiring strong reasoning capabilities;
- **Critic**: Adversarially reviews specifications to identify missing boundaries and security vulnerabilities;
- **Implementer**: Executes coding according to approved specifications, can use low-cost models;
- **QA Auditor**: Validates implementation results against specifications;
- **Human Approver**: Makes final decisions at key nodes (security/privacy, etc.).

Role separation enables auditability and forms a complete delivery archive.

## Model Routing Strategy

Select appropriate models based on task characteristics instead of using the strongest configuration for all tasks:
- **High Ambiguity Tasks** (requirements analysis, architecture trade-offs, security reviews): Prioritize strong reasoning models (e.g., GPT-4, Claude 3 Opus);
- **Structured Implementation Tasks** (coding after clear specifications): Use low-cost models (e.g., GPT-4o-mini, Claude 3 Haiku);
- **Quality Audit Tasks**: Use models with good instruction-following ability to strictly validate against acceptance criteria.

It is recommended to record expected and actual model configurations, analyze correlations between cost, quality, and drift to optimize strategies.

## Applicable Scenarios and Tool Integration

### Applicable Scenarios
Recommended use cases: Cross-boundary changes, ambiguous requirements, security/privacy risks, customer feature changes, drift-prone tasks.
Full process can be skipped (must meet all conditions): Clear requirements, affects 1-2 files, simple validation, no risks, no specification-first requirements.

### Tool Integration
- Provides Markdown specification templates, HTML visualization templates, runtime configuration JSON templates, QA checklists;
- Pi coding agent adapter supporting general model configurations;
- Unified runtime directory naming: `specs/YYYYMMDD-HHMM-feature-slug/`, containing complete delivery archives.

## Implications for Development Teams

- **Technical Leads**: Control AI agent risks, balance efficiency and code quality;
- **Development Teams**: Clarify human-AI collaboration boundaries, reduce anxiety about AI dependency;
- **Organizational Governance**: Delivery archives provide basic materials for compliance audits, proving code has undergone review and validation.

## Summary and Outlook

The Agentic Delivery Playbook represents the evolution of AI-assisted development from "prompt engineering" to "systems engineering", establishing a checkable, auditable, and optimizable delivery system.

Current version v0.2.0 covers evidence integrity, budget awareness, and ROI evaluation guidance. Future directions:
- Deep integration with IDEs and CI/CD systems;
- Automated model routing optimization based on historical data;
- Specialized templates for specific tech stacks.

Provides a starting point for teams using AI programming agents at scale to balance efficiency and quality.