Reading

Building a Node.js AI Agent from Scratch: A Practical Analysis of Tool Calling Based on the ReAct Pattern

This article provides an in-depth analysis of a command-line AI Agent project built with Node.js and the OpenAI API. It details the structured reasoning cycle (ReAct pattern) of PLAN→ACTION→OBSERVATION→OUTPUT it adopts, as well as how to integrate external tools like weather queries via function calls, offering developers a clear introductory example to understand the AI Agent architecture.

AI AgentNode.jsOpenAIReAct模式工具调用函数调用大语言模型智能代理命令行应用JavaScript

Published 2026-05-31 13:12Recent activity 2026-05-31 13:19Estimated read 6 min

Section 01

[Introduction] Building a Node.js AI Agent from Scratch: A Practical Analysis of Tool Calling Based on the ReAct Pattern

This article analyzes a command-line AI Agent project built with Node.js and the OpenAI API. It corely uses the ReAct pattern (PLAN→ACTION→OBSERVATION→OUTPUT cycle) to implement tool calls (e.g., weather queries), providing developers with a clear introductory example to understand the AI Agent architecture. The project is from GitHub user shah-harshil-07's base-ai-agent-app (released on May 31, 2026).

Section 02

Project Background and Core Concepts

An AI Agent is a bridge connecting large language models to the real world. This project is a lightweight command-line AI Agent based on Node.js and the OpenAI API. Its core value lies in its clear architecture—it adopts the ReAct (Reasoning+Acting) pattern, broken down into four stages: planning, execution, observation, and output. It is easy to understand and maintain, making it an ideal starting point for learning AI Agent development. The ReAct pattern endows the model with reasoning, action, and observation feedback capabilities to handle complex multi-step tasks.

Section 03

Detailed Explanation of the Four-Stage Reasoning Cycle of the ReAct Pattern

The project strictly follows the PLAN→ACTION→OBSERVATION→OUTPUT cycle:

PLAN: Analyze the user's intent and determine whether a tool is needed (e.g., decide to call the weather tool if asking about Mumbai's weather);
ACTION: Generate a structured function call (e.g., extract "Mumbai" as a parameter for a JSON request);
OBSERVATION: Receive the tool's return result (e.g., weather API data) and integrate it into the context;
OUTPUT: Generate a natural language answer based on all information (e.g., details of Mumbai's weather).

Section 04

Analysis of Technical Implementation Architecture

Technical Selection and Design:

Node.js: Asynchronous non-blocking I/O is suitable for network requests; the JS/TS ecosystem is rich, and it is lightweight, making it ideal for command-line use;
OpenAI API: Leverages function calling capabilities to automatically identify tool call needs and generate JSON parameters, simplifying development;
Tool Function Design: Taking weather queries as an example, it requires a clear name (get_weather), explicit parameters (city name), error handling, and structured returns, which can be extended to tools like search engines.

Section 05

Code Structure and Implementation Key Points

Core Main Loop: Receive input → Build context (system prompt + tool definitions + conversation history) → Call OpenAI API → Determine response type (answer/tool call) → Execute tool → Feedback result → Output answer. Prompt Engineering Strategy: Clearly define the Agent's role, tool scenarios, and output format, emphasizing step-by-step reasoning. State Management: Maintain conversation history, tool call records, and session metadata (in-memory or persisted).

Section 06

Scalability and Application Scenarios

Expansion Directions:

Tool Ecosystem: Add search engines, calculation tools, database access, etc. (only need to implement functions + define parameters + register + update prompts);
Multi-Agent Collaboration: Agents from different fields (data analysis, code generation) collaborate via message passing;
Practical Scenarios: Intelligent customer service, data analysis assistants, development aids, personal assistants, etc.

Section 07

Development Practice Recommendations

Practice Key Points:

Error Handling: API retries, invalid input clarification, timeout control, loop detection;
Cost Control: Context compression, caching, model selection (GPT-3.5/GPT-4), tool call optimization;
Security and Privacy: Input validation to prevent injection, permission control, data desensitization, audit logs.

Section 08

Summary and Outlook

This project demonstrates the core architecture of an AI Agent with concise code, transforming LLMs from text generation to intelligent agents via the ReAct pattern. Mastering Agent development will become a core skill for AI developers. Future Trends: Multi-modal perception, long-term memory, self-improving Agent systems, further blurring the boundary between humans and machines.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

Folkering OS: When the Operating System Itself Is AI—A Self-Evolving Bare-Metal Rust System

Folkering OS is the world's first AI-native bare-metal operating system, entirely written in Rust no_std without relying on Linux, POSIX, or libc. It can generate commands from scratch, compile them into WASM, and run them in 10 seconds, achieving true self-evolution.

Recent activity 2026-04-09 16:15