Reading

ExTrm: A Journey of Exploring Experimental Miniature Reasoning Models Built with Elixir

ExTrm is an experimental reasoning model project based on the Elixir language, exploring two architectural directions: grid reasoning and text/code reasoning, providing a lightweight experimental platform for research-oriented AI development.

Elixir推理模型NxARC网格推理代码生成实验性AI函数式编程

Published 2026-05-19 07:56Recent activity 2026-05-19 08:23Estimated read 7 min

ExTrm: A Journey of Exploring Experimental Miniature Reasoning Models Built with Elixir

Section 01

ExTrm: Exploring Experimental Miniature Reasoning Models Based on Elixir (Introduction)

ExTrm is an experimental miniature reasoning model project entirely written in Elixir, aiming to provide a lightweight experimental platform for research-oriented AI development. It explores two core directions: grid reasoning (to solve ARC-style tasks) and text/code reasoning (supporting long contexts and code generation), using Nx as the tensor computation backend. The project is positioned as a research codebase rather than a production-grade framework, focusing on rapid iteration to validate architectural ideas.

Section 02

Project Background and Positioning

In the AI field, most reasoning models are built based on the Python ecosystem, but Elixir has become a new choice due to its concurrency model and functional programming features. ExTrm is not a mature production framework but a research codebase, aiming to provide an experimental platform for rapid iteration, trial-and-error, and validation of new architectures. Its code style is intentionally kept rough to prioritize idea validation, making it suitable for developers willing to experiment hands-on.

Section 03

Research Direction 1: Grid Reasoning Model

Grid reasoning was the initial core of the project, focusing on solving ARC (Abstraction and Reasoning Corpus) style tasks—pattern recognition and reasoning on colored grids. Key architectural elements include: recursive block structure (supporting multi-step reasoning), repeated thinking steps (simulating human reasoning), and colored grid representation. The default parameter count is about 100 million; running on CPU requires passing smaller configuration parameters.

Section 04

Research Direction 2: Text/Code Reasoning Model

The text/code reasoning direction is more practical, with components including: byte-level tokenizer (no pre-trained vocabulary needed), Hugging Face dataset integration, long-context text model foundation, and a complete training/saving/inference pipeline. The project uses the karti06k/Qwen-59k-Python-Instruct dataset for training, which contains instructions, reasoning processes, and code, suitable for code generation models.

Section 05

Highlights of Technical Architecture

Long Context Support: The text model supports a context length of 128K at the architectural level, with configuration parameters such as context_length:131072 and attention_window:2048. It uses techniques like chunked pre-filling, sliding attention, RoPE-style positional encoding, and memory tokens, but its actual capabilities need to be verified through training. Nx Backend: Uses Nx (Numerical Elixir) as the tensor computation backend, providing a NumPy-like API while maintaining a functional style, making it easy for Elixir developers to conduct deep learning experiments.

Section 06

Quick Start Guide and Code Structure

Code Structure: Modules are clearly divided. The core of the grid model is in lib/ex_trm/model.ex, the core of the text model is in lib/ex_trm/text/model.ex, and there are also modules for dataset loading, training, inference, and command-line tools. Quick Start:

Environment preparation: mix deps.get → mix test
Small-scale grid experiment: Need to pass small configuration parameters (e.g., vocab_size:8, d_model:16, etc.)
Text model operations: Download dataset (mix ex_trm.text.dataset.download), small-scale training (mix ex_trm.text.train --rows64 --steps10), inference test (mix ex_trm.text.generate --prompt ...)

Section 07

Target Audience and Usage Suggestions

ExTrm is suitable for: AI enthusiasts in the Elixir ecosystem, model architecture researchers (for rapid idea validation), educational purposes (learning reasoning model mechanisms), and ARC task researchers. It is not suitable for developers expecting an out-of-the-box production framework. It is recommended to treat it as a laboratory workbench to freely experiment with architectural ideas without pursuing a perfect API.

Section 08

Conclusion: The Democratization Direction of AI Development

ExTrm proves that AI development does not have to be limited to the Python ecosystem. Elixir's concurrency features and fault-tolerant design are theoretically suitable for distributed training. Although it is in the early experimental stage, it provides a valuable starting point for the Elixir community. As the author said: "Some ideas may stay, some may get deleted, some may become separate models. That's fine." This open experimental mindset drives the development of AI frontiers.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

Folkering OS: When the Operating System Itself Is AI—A Self-Evolving Bare-Metal Rust System

Folkering OS is the world's first AI-native bare-metal operating system, entirely written in Rust no_std without relying on Linux, POSIX, or libc. It can generate commands from scratch, compile them into WASM, and run them in 10 seconds, achieving true self-evolution.

Recent activity 2026-04-09 16:15