Zing Forum

Reading

LangGraph Multi-Agent Investment Research System: AI Investment Research Practice in the Chemical Industry

A multi-skill agent system built on LangGraph, integrating RAG hybrid retrieval, LightGBM quantitative ranking, and multi-source data fusion, to provide an end-to-end solution for investment research in the chemical industry.

LangGraph投资研究RAGLightGBM智能体化工行业量化分析多源融合
Published 2026-04-10 12:11Recent activity 2026-04-10 12:32Estimated read 6 min
LangGraph Multi-Agent Investment Research System: AI Investment Research Practice in the Chemical Industry
1

Section 01

Introduction: LangGraph Multi-Agent System Empowers AI Investment Research in the Chemical Industry

The multi-skill agent system ai_invest_agent, built on LangGraph, is designed for investment research scenarios in the chemical industry. It integrates RAG hybrid retrieval, LightGBM quantitative ranking, and multi-source data fusion technologies to address pain points such as scattered information and heterogeneous data. It realizes an end-to-end automated process from information collection to investment decision recommendations, reshaping the new paradigm of AI investment research.

2

Section 02

Project Background and Core Architecture

Investment research in the chemical industry faces pain points like scattered information, heterogeneous data, and complex analysis dimensions, which traditional manual methods struggle to fully cover. This project uses LangGraph as the agent orchestration framework (supporting loops, conditional branches, and state management). The system architecture includes three main modules: Data Collection Layer (multi-source data acquisition), Intelligent Analysis Layer (professional agents perform analysis), and Decision Fusion Layer (integrate results to generate recommendations).

3

Section 03

RAG Hybrid Retrieval: Key Technology to Break Information Silos

The system adopts a hybrid retrieval strategy: vector retrieval (finance text-optimized embedding model for semantic matching), keyword retrieval (BM25 for precise matching of key entities), and structured query (SQL-like processing of structured data). The three results are fused via a re-ranking model, balancing recall and precision, and adapting to the feature of dense professional terms in the chemical industry.

4

Section 04

LightGBM Quantitative Ranking: Transformation from Information to Investment Signals

LightGBM is introduced to build a quantitative ranking model. Features cover fundamental aspects (revenue growth rate, ROE, etc.), market sentiment (text sentiment scores), industrial chain (supply-demand relationships), and technical aspects (price trends, etc.). The model is efficient and outputs a comprehensive score. Combined with SHAP values, it provides interpretability to help understand the driving factors of the scores.

5

Section 05

Multi-source Data Fusion: Building a Complete Investment Decision Picture

Fusing multi-channel information: financial report data (parsing PDFs to extract financial indicators), industry reports (extracting trend judgments), supply chain intelligence (monitoring production capacity/orders/inventory), and policies & news (tracking macro impacts). A confidence-weighted mechanism is used to mark information conflicts, ensuring comprehensive and reliable decision-making basis.

6

Section 06

Multi-agent Collaboration: Efficient Decision-making Process with Clear Division of Labor

The system consists of multiple professional agents: data collection, financial report analysis, industry research, risk assessment, quantitative model, and decision fusion agents. Agents collaborate via LangGraph state diagrams, supporting conditional branches (e.g., high risk triggers in-depth due diligence) and iterative loops (expanding retrieval when information is insufficient).

7

Section 07

Practical Value and Future Application Prospects

Practical value: improving efficiency (freeing up researchers), comprehensive coverage (multi-source fusion), strong interpretability (clear logical chain), and continuous learning (optimizing strategies and models). In the future, it can integrate multi-modal data such as satellite images (factory operating rates) and audio (earnings call conferences) to provide more comprehensive intelligence support.