Reading

Large Language Models Combined with Genetic Programming: Automatically Discovering Interpretable Kalman Filter Variants

大语言模型遗传编程卡尔曼滤波器符号回归算法发现可解释AI进化计算状态估计DeepSeekFunSearch

Published 2026-05-18 10:43Recent activity 2026-05-18 10:49Estimated read 7 min

Section 01

[Introduction] Large Language Models Combined with Genetic Programming: Automatically Discovering Interpretable Kalman Filter Variants

A research team from ETH Zurich proposed a new method combining large language models (LLMs) and Cartesian Genetic Programming, which can automatically discover and optimize Kalman filtering algorithms from raw data. This method outperforms traditional Kalman filters in adversarial environments, and the generated symbolic expressions are interpretable, providing a new paradigm for the automatic improvement of classic algorithms.

Section 02

Research Background and Motivation

Since 1960, the Kalman filter has been a cornerstone in the field of state estimation, but it relies on precise system modeling and is limited when facing challenges such as model mismatch, nonlinear systems, and non-Gaussian noise. In recent years, studies like FunSearch have shown that combining LLMs with evolutionary algorithms can automatically discover complex mathematical structures and algorithms, inspiring the team to explore applying this paradigm to Kalman filters. The core question is: Can AI automatically rediscover and improve the Kalman filter?

Section 03

Core Methodology

Two complementary evolutionary search strategies are adopted:

Cartesian Genetic Programming (CGP)：A graph-based variant of genetic programming that uses a fixed-size genotype grid to express state update equations, generating compact and interpretable symbolic expressions.
LLM-assisted FunSearch-style Mutation：Integrates the DeepSeek-R1-Distill-Qwen-14B model to generate semantically meaningful code variants based on the best programs in the population, combining LLM code understanding and evolutionary exploration capabilities. Experimental setup: 4×NVIDIA H100 GPUs, 64-core CPU node, each experiment takes about 3 days, maximum model generation length is 3024 tokens, and 4 independent evolutionary islands run in parallel.

Section 04

Experimental Design and Evaluation Framework

Two core experiments are designed:

Ideal Condition Evaluation：Linear dynamic system + Gaussian noise scenario, verifying whether the classic Kalman update rules can be rediscovered from input-output trajectories, using Mean Squared Error (MSE) to compare with the theoretical optimal solution.
Adversarial and Non-Ideal Condition Evaluation：Testing robustness, including scenarios that are difficult for traditional Kalman filters to handle, such as observation delays, nonlinear systems, and non-Gaussian noise (heavy-tailed/skewed).

Section 05

Key Findings and Results

Near-Optimal Algorithm Recovery：Both pure CGP or LLM-assisted search can rediscover Kalman-like update rules from data, and their performance is close to the theoretical optimal under ideal conditions.
Surpassing Classic Kalman Filters：Under non-ideal conditions, the symbolically evolved programs systematically outperform traditional Kalman filters and can automatically adapt to model mismatches.
Interpretability Advantage：The output is symbolic, human-readable mathematical expressions, allowing engineers to understand and verify the logic, which is superior to black-box neural networks.

Section 06

Practical Application Value and Significance

Contribution to Control Theory: Data-driven methods can automatically explore the algorithm space, discover improvement schemes that human experts may overlook, and break through the traditional design model that relies on mathematical skills and domain knowledge.
Implications for Engineering Practice: Suitable for scenarios where system dynamics are unknown/time-varying, noise is complex and difficult to parameterize, performance and efficiency need to be balanced, and interpretability is required.
Methodological Inspiration: LLMs (such as the 14B-parameter DeepSeek model) can act as "intelligent mutation operators" in evolutionary computing, opening up new directions for neuro-symbolic AI.

Section 07

Limitations and Future Directions

Limitations: High computational cost (each experiment takes about 3 days of GPU time), problem specificity (customized for Kalman filtering), and lack of convergence theory guarantees. Future Directions: Explore more efficient LLM calling strategies, apply to other classic algorithms (such as PID controllers, particle filters), combine neural networks with symbolic programs, and develop a convergence theory framework for LLM-assisted evolution.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

Folkering OS: When the Operating System Itself Is AI—A Self-Evolving Bare-Metal Rust System

Folkering OS is the world's first AI-native bare-metal operating system, entirely written in Rust no_std without relying on Linux, POSIX, or libc. It can generate commands from scratch, compile them into WASM, and run them in 10 seconds, achieving true self-evolution.

Recent activity 2026-04-09 16:15