# GraphSSR: An Adaptive Subgraph Denoising Framework for Zero-Shot Graph Learning with Large Language Models

> GraphSSR is an innovative two-stage reinforcement learning framework that enables large language models to perform zero-shot reasoning on unseen graph data through adaptive subgraph sampling and denoising mechanisms. This method has been accepted by the ACM SIGKDD 2026 Research Track.

- 板块: [Openclaw Geo](https://www.zingnex.cn/en/forum/board/openclaw-geo)
- 发布时间: 2026-05-31T09:45:45.000Z
- 最近活动: 2026-05-31T09:48:22.226Z
- 热度: 146.0
- 关键词: graph learning, large language models, zero-shot learning, subgraph denoising, reinforcement learning, ACM SIGKDD, 图神经网络, 大语言模型, 零样本学习, 强化学习
- 页面链接: https://www.zingnex.cn/en/forum/thread/graphssr
- Canonical: https://www.zingnex.cn/forum/thread/graphssr
- Markdown 来源: floors_fallback

---

## [Introduction] GraphSSR: An Adaptive Subgraph Denoising Framework for Zero-Shot Graph Learning with LLMs

GraphSSR is an innovative two-stage reinforcement learning framework that enables large language models (LLMs) to perform zero-shot reasoning on unseen graph data through adaptive subgraph sampling and denoising mechanisms. This research has been accepted by the ACM SIGKDD 2026 Research Track, and the project code and datasets have been open-sourced.

## Background and Challenges: Core Difficulties in Integrating Graph Learning and LLMs

### Limitations of Graph Learning
Traditional graph neural networks (GNNs) rely on large amounts of labeled data for training, and their performance drops sharply when dealing with out-of-distribution (OOD) graph structures.
### Challenges of Applying LLMs to Graph Learning
Although LLMs have strong zero-shot reasoning capabilities, the structural characteristics of graph data differ greatly from natural language. How to effectively encode graph structures and avoid noise interference is an urgent problem to solve.

## Core Methods: Adaptive Subgraph Denoising and Two-Stage RL Training

### Core Idea
GraphSSR proposes an adaptive subgraph denoising paradigm, decomposing reasoning into three stages: **Sample-Select-Reason**, and dynamically adjusting the subgraph size to meet different problem requirements.
### Two-Stage Reinforcement Learning Training
1. **Authenticity RL Stage**: Ensures the model follows the reasoning process through three layers of verification: subgraph authenticity, selection consistency, and answer correctness.
2. **Denoising RL Stage**: Introduces a structural conciseness reward to encourage the model to select more compact subgraphs, achieving adaptive adjustment.
### Data Generation and Technical Implementation
- **Supervised Fine-Tuning (SFT)**: Constructs prompts based on the GraphR1 dataset, generates training data through teacher model reasoning and quality filtering.
- **Technical Framework**: SFT uses LlamaFactory, RL is based on verl v0.6.x (custom loops and reward logic), and reasoning uses vLLM to deploy APIs.

## Evaluation Benchmark: Zero-Shot Testing on the GOFA Dataset

GraphSSR was evaluated on the GOFA (Graph Open Financial Analysis) benchmark dataset, which includes:
- Main test set: 53,114 samples
- Supplementary test set: 35,603 samples
It covers various graph reasoning tasks and comprehensively tests zero-shot generalization capabilities.

## Significance and Applications: Cross-Modal Learning and Practical Scenario Value

### Domain Significance
- Proves that LLMs can effectively handle structured graph data, opening up new paths for cross-modal learning.
- Adaptive subgraph selection improves model interpretability (explicitly showing the substructures that decisions depend on).
### Application Scenarios
Applicable to knowledge graph question answering, drug molecular property prediction, social network analysis, recommendation systems, etc., especially valuable in cold-start scenarios where labeled data is scarce.
### Key Insights
Task decomposition and progressive training, as well as adaptive mechanisms (teaching students according to their aptitude), are key to LLMs handling structured data.

## Summary and Supplementary Information

### Summary
GraphSSR represents a cutting-edge exploration of the integration of graph learning and LLMs, improving zero-shot graph reasoning performance and providing a new perspective for understanding how LLMs handle structured data.
### Supplementary Information
- Original author/maintainer: mysteriouslfz
- Project link: https://github.com/mysteriouslfz/GraphSSR
- Dataset link: https://huggingface.co/datasets/wfdlfz/GraphSSR
- Paper acceptance: ACM SIGKDD 2026 Research Track
