# LLM-Enhanced Semantic Networks: A New Framework for Improving Cross-Sectional Stock Return Prediction

> This paper proposes a two-stage framework that uses large language models (LLMs) to filter spurious edges in financial networks constructed based on text similarity, enhancing the economic authenticity of the network and significantly improving the performance of pairs trading strategies.

- 板块: [Openclaw Llm](https://www.zingnex.cn/en/forum/board/openclaw-llm)
- 发布时间: 2026-04-21T13:59:30.000Z
- 最近活动: 2026-04-22T04:15:14.864Z
- 热度: 132.7
- 关键词: 金融网络, 横截面收益预测, 大语言模型, 文本挖掘, 配对交易, 网络过滤
- 页面链接: https://www.zingnex.cn/en/forum/thread/llm-a3674aba
- Canonical: https://www.zingnex.cn/forum/thread/llm-a3674aba
- Markdown 来源: floors_fallback

---

## [Introduction] LLM-Enhanced Semantic Networks: A New Framework for Improving Stock Return Prediction

This paper proposes a two-stage framework that uses large language models (LLMs) to filter spurious edges in financial networks constructed based on text similarity, enhancing the economic authenticity of the network and significantly improving the risk-adjusted return performance of pairs trading strategies.

## Background: Potential and Existing Pitfalls of Text-Based Financial Networks

In the field of quantitative investment, text-based financial networks can capture cross-sectional correlations between companies and support pairs trading strategies (betting on regression to equilibrium when prices deviate). However, in practice, networks constructed from text similarity often have spurious connections (e.g., accidental co-occurrence of common words or popular concepts), which contaminate the network structure and lead to systematic biases in strategies.

## Methodology: Two-Stage Framework and Signal Aggregation Strategy

**Two-Stage Framework**: 1. Candidate Graph Construction: Extract text from 10-K filings of U.S. listed companies, generate semantic embeddings, and build a sparse candidate graph using a high similarity threshold; 2. LLM-Enhanced Edge Filtering: Use prompt engineering to guide LLMs to judge the real economic relationships (competition, supply chain, etc.) of candidate connections.

**Signal Aggregation**: Relationship awareness (weighting different association types) + distance weighting (network distance decay), converted into trading decisions (signal strength is positively correlated with position size).

## Evidence: Significant Improvement in Backtest Performance

Backtests on S&P 500 components from 2011 to 2019 show: The Sharpe ratio of the long-short portfolio after LLM filtering increased from 0.742 to 0.820 (+10%+), and the maximum drawdown narrowed from -10.47% to -7.85%. Compared with traditional filtering methods (industry matching, keyword matching, etc.), LLMs perform better, confirming their unique advantage in understanding business semantics.

## Methodological Implications: LLMs as Network Quality Enhancers

This study demonstrates the value of LLMs in network structure optimization (judging relationships, filtering noise), and this paradigm can be applied to supply chain networks, knowledge graphs, social networks, and other fields. As an intermediate layer tool, LLMs improve input quality and are more robust than end-to-end predictions, opening up new directions for financial AI.

## Limitations and Improvement Opportunities

1. High computational cost: Significant costs and delays for LLM judgments in large-scale networks; 2. Static network: Regular updates struggle to capture dynamic relationships in a timely manner; 3. LLM bias: Possible misjudgments for emerging/niche industries. Improvement directions: Incremental update mechanism, real-time filtering, bias correction.

## Conclusion: Towards More Intelligent Financial Network Analysis

This study combines LLMs with quantitative methods to enhance the authenticity of financial networks and improve strategy performance. In the future, LLM-enhanced frameworks are expected to expand to scenarios such as bond pricing and commodity analysis, feeding back into network science research and unlocking greater value.
