章节 01
ChatSR: The First Multimodal LLM for Symbolic Regression (Core Overview)
ChatSR: A Scientific Multimodal Large Language Model for Discovering Formulas from Scientific Data
ChatSR is the first multimodal large language model in the symbolic regression field. It encodes scientific data using Set Transformer, generates preorder traversal of mathematical expressions describing data patterns, supports BFGS optimization for constant terms, and calculates the fitting degree R².
This project innovatively applies multimodal large language models to symbolic regression, leveraging their sequence generation capability to directly output structured representations of mathematical expressions, providing a new AI-driven tool for scientific discovery.