Reading

AgentLog: An Agent Workflow Auditing and Replay Tool Based on OTel GenAI Semantic Specifications

An open-source, self-hostable agent workflow auditing tool that adheres to OpenTelemetry GenAI semantic specifications. It provides scanning, replay, and auditing capabilities, supports MCP protocol integration with mainstream AI development environments, and is suitable for AI security governance in CI/CD pipelines.

智能体AgentOpenTelemetryOTelGenAI可观测性审计MCPCI/CD安全治理

Published 2026-06-09 01:15Recent activity 2026-06-09 01:20Estimated read 7 min

Section 01

Introduction to AgentLog: An Agent Workflow Auditing and Replay Tool Based on OTel GenAI Semantic Specifications

This article introduces the open-source tool AgentLog, a self-hostable agent workflow auditing tool that adheres to OpenTelemetry GenAI semantic specifications. It provides scanning, replay, and auditing capabilities, supports MCP protocol integration with mainstream AI development environments, and is suitable for AI security governance in CI/CD pipelines. The original author is cognis-digital, and the project is open-sourced on GitHub (link: https://github.com/cognis-digital/agentlog), released on 2026-06-08.

Section 02

Observability Challenges of Agent Workflows (Background)

With the development of LLM and agent technologies, enterprises integrating AI agents face governance challenges: black box problem (opaque decision-making process), security risks (access to sensitive data or dangerous operations), compliance requirements (auditing AI behaviors), and difficulty in troubleshooting (lack of debugging tools). Traditional APM tools are designed for deterministic systems and are difficult to adapt to non-deterministic systems like agents, requiring a specialized solution.

Section 03

Core Features and Technical Architecture of AgentLog

Core Features: 1. Workflow scanning (supports CLI commands such as agentlog scan ., can output JSON/SARIF, CI gatekeeping function); 2. Multi-format reports (tables, JSON, SARIF); 3. Native MCP support (compatible with Claude Desktop, Cursor, etc.); 4. Cross-platform and multi-language (Linux/macOS/Windows/Docker, Python/JS/Go/Rust versions).

Technical Architecture: Modular pipeline design: Input → Collector → Rule Engine/Analyzer → Scorer → Report Generator (outputs tables/JSON/SARIF/MCP tools).

Section 04

OTel GenAI Semantic Specification Support (Technical Highlight)

AgentLog fully supports the OpenTelemetry GenAI semantic specification, which provides a standardized telemetry model for generative AI applications.

Key Semantic Attributes: gen_ai.system (AI system identifier), gen_ai.request.model (request model), gen_ai.response.model (response model), gen_ai.usage.input/output_tokens (token usage), gen_ai.response.finish_reason (finish reason).

Event Types: gen_ai.content.prompt (user prompt), gen_ai.content.completion (model completion content). Following this standard allows integration with any OTel observability platform, avoiding vendor lock-in.

Section 05

Application Scenarios and Ecosystem Integration

Application Scenarios: 1. CI/CD integration (e.g., using agentlog scan . --fail-on high in GitHub Actions); 2. AI security governance (identify sensitive data leaks, dangerous operations, unauthorized model access, prompt injection vulnerabilities); 3. Compliance auditing (meet GDPR, finance/healthcare industry requirements); 4. Development debugging (replay execution process to locate issues).

Ecosystem Integration: Supports MCP servers, OpenAI-compatible APIs, and frameworks like LangChain/CrewAI/AutoGen/LlamaIndex.

Competitor Comparison: AgentLog's advantages include self-hostable (no account required), single command with zero configuration, JSON+SARIF support for CI, native MCP, multi-language porting, and open license (COCL).

Section 06

Deployment Methods and Future Outlook

Deployment Methods: 1. Package manager (pip install cognis-agentlog); 2. Docker (docker run ghcr.io/cognis-digital/agentlog); 3. Platform scripts (dedicated scripts for Linux/macOS/Windows); 4. Cloud platforms (AWS/Azure/GCP/k8s, refer to DEPLOY.md).

Limitations: The current version focuses on static code scanning.

Future Outlook: Runtime monitoring, distributed tracing (multi-agent collaboration), behavior baseline (establish normal baselines from historical data), auto-fix (suggest or automatically apply fixes).

Section 07

Conclusion

AgentLog is an important advancement in the field of AI observability. By adopting OTel GenAI specifications and MCP protocol, it provides enterprises with a powerful and flexible agent governance tool. As AI agents become core infrastructure, such tools are crucial for ensuring AI systems are secure, trustworthy, and compliant. Teams building or operating AI agent systems are advised to evaluate and adopt AgentLog.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

Building an AWS Generative AI Application from Scratch: EC2 + Bedrock Hands-On Tutorial

A complete cloud-native AI application development guide for beginners, building a simple generative AI chatbot using Amazon EC2, Apache, Python CGI, and Amazon Bedrock, covering architecture design, IAM permission configuration, security best practices, and cost optimization suggestions.

Recent activity 2026-06-02 19:49