Reading

Self-Evolving Scientific Agent: Automatic Discovery of Physically Reasoned Controllers Driven by Large Models

The study proposes a self-evolving scientific agent workflow driven by large language models, which automatically constructs controllers through iterative code generation. In the swimming control task of a two-joint bionic fish, the agent discovers and optimizes interpretable and generalizable control strategies from scratch.

科学智能体大语言模型代码生成物理推理控制器设计流固耦合可解释AI

Published 2026-06-07 09:59Recent activity 2026-06-09 11:53Estimated read 7 min

Section 01

[Introduction] Self-Evolving Scientific Agent: Automatic Discovery of Physically Reasoned Controllers Driven by Large Models

Original Author/Maintainer: Paper Research Team Source Platform: arXiv Original Title: Self-Evolving Scientific Agent Discovers Generalizable Physically-Reasoned Fluid Control Original Link: http://arxiv.org/abs/2606.08405v1 Publication Date: June 7, 2026

Core Viewpoint: The study proposes a self-evolving scientific agent workflow driven by large language models, which automatically constructs controllers through iterative code generation. In the swimming control task of a two-joint bionic fish, the agent discovers and optimizes interpretable and generalizable control strategies from scratch.

Section 02

The Dilemma of Automating Scientific Discovery

Data-intensive deep reinforcement learning can optimize complex control strategies, but scientific discovery for physical systems requires an interpretable chain of reasoning—connecting physical evidence to a structured control architecture. Traditional methods adjust neural network weights to learn strategies; although effective, the controllers are mostly black boxes and difficult to understand and verify.

Scientific discovery not only needs to find feasible solutions but also requires understanding 'why it works'. Pure data-driven methods face challenges in this scenario.

Section 03

Self-Evolving Scientific Agent Workflow

The study proposes a self-evolving scientific agent workflow driven by large language models and implemented via iterative code generation. The core innovation is directly manipulating control strategies at the source code level instead of adjusting weights.

Three stages of the workflow:

Deployment and Observation: Deploy the candidate strategy to physical simulation and actively diagnose dynamic behaviors (similar to scientists consciously observing system responses);
Multimodal Evidence Analysis: Extract physical insights from multimodal data such as motion trajectories, forces, and energy changes, and transform them into understanding at the physical concept level;
Code-Level Strategy Optimization: Generate improved controller code based on observations, making the strategy fully readable and verifiable.

Section 04

Validation Task: Swimming Control of a Two-Joint Bionic Fish

The validation scenario selects an underactuated two-joint bionic fish (dogfish swimmer), which reaches the target position only through joint angular acceleration control (a nonlinear fluid-structure interaction problem).

Initial Condition: Start from a defective seed strategy with a one-sided steering bias; need to independently discover a unified controller to reach targets in all directions.

Generalization Ability:

Generalizes to unseen static targets without retraining or target-specific branches;
Handles dynamic curved pursuit trajectories and adapts to complex movements; Generalization comes from the basis of physical reasoning, not memory or interpolation.

Section 05

Interpretable Control Architecture

By auditing the evolution log, the components of the control architecture independently discovered by the agent are:

Traveling Wave Propulsion: Use body undulation to generate propulsive force;
Body Coordinate Target Guidance: Calculate the target direction in the fish's body coordinate system;
Yaw Rate Feedback: Adjust actions based on steering rate;
Signed Average Tail Curvature: Use tail shape information;
Adaptive Rhythm Mitigation: Dynamically adjust movement rhythm.

These components exist in the code in clear mathematical forms and are fully auditable and verifiable.

Section 06

Research Significance and Implications

The study demonstrates the ability of autonomous scientific agents to transform physical evidence into robust, mathematically readable control strategies while maintaining a traceable scientific discovery process.

Significance:

Scientific Automation: A paradigm shift from 'black-box optimization' to 'white-box reasoning'. In the future, agents can assist or lead scientific discoveries while maintaining the interpretability and verifiability of results;
Robotics and Control Theory: Provides a new path—using the reasoning ability of large models to generate control strategies with physical intuition instead of pure data fitting.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

Building an AWS Generative AI Application from Scratch: EC2 + Bedrock Hands-On Tutorial

A complete cloud-native AI application development guide for beginners, building a simple generative AI chatbot using Amazon EC2, Apache, Python CGI, and Amazon Bedrock, covering architecture design, IAM permission configuration, security best practices, and cost optimization suggestions.

Recent activity 2026-06-02 19:49