Reading

llm-metadata: Lightweight LLM Metadata Static API, Making Model Discovery Simple

A high-throughput-friendly static API project that provides LLM metadata query services via GitHub Pages, supporting multilingual localization, NewAPI format output, and deployment without a server.

LLM元数据静态APIGitHub Pages多语言模型发现NewAPICDN零运维

Published 2026-03-29 15:14Recent activity 2026-03-29 15:22Estimated read 7 min

Section 01

llm-metadata: Lightweight LLM Metadata Static API, Making Model Discovery Simple

In AI application development, LLM metadata (such as model name, context length, feature support, pricing, etc.) is scattered across various providers' documents, with inconsistent formats and frequent updates. The llm-metadata project provides a fully static API service, deployable via GitHub Pages or Cloudflare Pages, with advantages like zero operation and maintenance costs, high throughput performance, multilingual localization support, and NewAPI format compatibility, making LLM metadata acquisition simple and reliable.

Section 02

Project Background: Pain Points in LLM Metadata Acquisition

In AI application development, accurately obtaining LLM metadata is a common need, but this data is scattered across different providers' documents, with inconsistent formats and frequent updates, causing considerable trouble for developers. The llm-metadata project was born to solve this problem.

Section 03

Core Approach: Static-First Design Philosophy

llm-metadata takes the "static-first" as its core philosophy, providing services through pre-built static JSON files, which can be deployed on GitHub Pages or Cloudflare Pages. This design brings four major advantages: zero operation and maintenance costs (no need to maintain servers and databases), extremely high performance (global CDN caching, millisecond-level response), unlimited scalability (no concurrency limits), and free hosting (using platform free quotas). When data changes, only static files need to be regenerated without affecting the service.

Section 04

Data Coverage and Key Features

Data Sources: Integrates models.dev/api.json and community contributions (accepts corrections and supplements via the data/overrides directory). Covered Information: Basic model information, capability tags (tool calling, open-source weights, etc.), modality support (text/image/audio), context limits, pricing, provider configurations, etc. Multilingual Support: Adopts structured translation management. Adding a new language only requires three steps (update locales.json, create translation files, run build). After building, the localized API is located in the dist/api/i18n// directory. NewAPI Compatibility: Provides NewAPI format output, supporting direct use by existing toolchains.

Section 05

Usage Guide: Hosting and Customization

Direct Use: Access the GitHub Pages (https://basellm.github.io/llm-metadata/) or Cloudflare Pages (https://llm-metadata.pages.dev/) hosted versions. Local Build: Install dependencies (npm install), run build (npm run build), force build (npm run build:force), CI check (npm run check). Custom Overrides: Customize data via the data/overrides directory. Override files use a deep merge strategy, supporting field modifications at the model and provider levels (e.g., id, name, description, pricing, etc.).

Section 06

Automation and Application Scenarios

Automated CI/CD: Implemented via GitHub Actions. Trigger conditions include directory pushes, manual triggers, and scheduled runs every 6 hours. The incremental update strategy is controlled by data/policy.json; when auto=false, manually customized content is protected. Application Scenarios: 1. Model selector UI (filter by capability/price/context); 2. Cost estimation tool (pre-calculate usage costs); 3. Capability detection (avoid runtime errors); 4. Document generation (automatically sync the latest data).

Section 07

Architectural Advantages and Industry Insights

Architectural Advantages: Demonstrates an effective model of "static generation + CDN distribution + incremental updates + community-driven", suitable for scenarios with more reads than writes and low-frequency data changes (e.g., LLM metadata). Industry Insights: With the development of the AI ecosystem, similar static-first services may emerge, such as prompt template libraries, evaluation benchmark results, model performance comparisons, etc. These services require high availability, low latency, and zero operation and maintenance, and llm-metadata provides a replicable model.

Section 08

Summary: A Concise and Practical Metadata Solution

llm-metadata is a project with a concise design but practical functions. It solves the pain points of LLM metadata acquisition through static APIs, embodying the power of the "static-first" philosophy. For developers, it provides a reliable data source; for engineers, it is a case study on how to solve practical problems with minimal complexity.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

Folkering OS: When the Operating System Itself Is AI—A Self-Evolving Bare-Metal Rust System

Folkering OS is the world's first AI-native bare-metal operating system, entirely written in Rust no_std without relying on Linux, POSIX, or libc. It can generate commands from scratch, compile them into WASM, and run them in 10 seconds, achieving true self-evolution.

Recent activity 2026-04-09 16:15