Zing Forum

Reading

Self-Evolving Scientific Agent: Automatic Discovery of Physically Reasoned Controllers Driven by Large Models

The study proposes a self-evolving scientific agent workflow driven by large language models, which automatically constructs controllers through iterative code generation. In the swimming control task of a two-joint bionic fish, the agent discovers and optimizes interpretable and generalizable control strategies from scratch.

科学智能体大语言模型代码生成物理推理控制器设计流固耦合可解释AI
Published 2026-06-07 09:59Recent activity 2026-06-09 11:53Estimated read 7 min
Self-Evolving Scientific Agent: Automatic Discovery of Physically Reasoned Controllers Driven by Large Models
1

Section 01

[Introduction] Self-Evolving Scientific Agent: Automatic Discovery of Physically Reasoned Controllers Driven by Large Models

Original Author/Maintainer: Paper Research Team Source Platform: arXiv Original Title: Self-Evolving Scientific Agent Discovers Generalizable Physically-Reasoned Fluid Control Original Link: http://arxiv.org/abs/2606.08405v1 Publication Date: June 7, 2026

Core Viewpoint: The study proposes a self-evolving scientific agent workflow driven by large language models, which automatically constructs controllers through iterative code generation. In the swimming control task of a two-joint bionic fish, the agent discovers and optimizes interpretable and generalizable control strategies from scratch.

2

Section 02

The Dilemma of Automating Scientific Discovery

Data-intensive deep reinforcement learning can optimize complex control strategies, but scientific discovery for physical systems requires an interpretable chain of reasoning—connecting physical evidence to a structured control architecture. Traditional methods adjust neural network weights to learn strategies; although effective, the controllers are mostly black boxes and difficult to understand and verify.

Scientific discovery not only needs to find feasible solutions but also requires understanding 'why it works'. Pure data-driven methods face challenges in this scenario.

3

Section 03

Self-Evolving Scientific Agent Workflow

The study proposes a self-evolving scientific agent workflow driven by large language models and implemented via iterative code generation. The core innovation is directly manipulating control strategies at the source code level instead of adjusting weights.

Three stages of the workflow:

  1. Deployment and Observation: Deploy the candidate strategy to physical simulation and actively diagnose dynamic behaviors (similar to scientists consciously observing system responses);
  2. Multimodal Evidence Analysis: Extract physical insights from multimodal data such as motion trajectories, forces, and energy changes, and transform them into understanding at the physical concept level;
  3. Code-Level Strategy Optimization: Generate improved controller code based on observations, making the strategy fully readable and verifiable.
4

Section 04

Validation Task: Swimming Control of a Two-Joint Bionic Fish

The validation scenario selects an underactuated two-joint bionic fish (dogfish swimmer), which reaches the target position only through joint angular acceleration control (a nonlinear fluid-structure interaction problem).

Initial Condition: Start from a defective seed strategy with a one-sided steering bias; need to independently discover a unified controller to reach targets in all directions.

Generalization Ability:

  • Generalizes to unseen static targets without retraining or target-specific branches;
  • Handles dynamic curved pursuit trajectories and adapts to complex movements; Generalization comes from the basis of physical reasoning, not memory or interpolation.
5

Section 05

Interpretable Control Architecture

By auditing the evolution log, the components of the control architecture independently discovered by the agent are:

  • Traveling Wave Propulsion: Use body undulation to generate propulsive force;
  • Body Coordinate Target Guidance: Calculate the target direction in the fish's body coordinate system;
  • Yaw Rate Feedback: Adjust actions based on steering rate;
  • Signed Average Tail Curvature: Use tail shape information;
  • Adaptive Rhythm Mitigation: Dynamically adjust movement rhythm.

These components exist in the code in clear mathematical forms and are fully auditable and verifiable.

6

Section 06

Research Significance and Implications

The study demonstrates the ability of autonomous scientific agents to transform physical evidence into robust, mathematically readable control strategies while maintaining a traceable scientific discovery process.

Significance:

  • Scientific Automation: A paradigm shift from 'black-box optimization' to 'white-box reasoning'. In the future, agents can assist or lead scientific discoveries while maintaining the interpretability and verifiability of results;
  • Robotics and Control Theory: Provides a new path—using the reasoning ability of large models to generate control strategies with physical intuition instead of pure data fitting.