Section 01
Research Guide to End-to-End Pipeline for Knowledge Graph Construction from Unstructured Documents Using Large Language Models
This master's thesis project explores how to use large language models (LLMs) to extract structured knowledge from unstructured documents and build a complete data pipeline for large-scale knowledge graphs. It aims to address issues in traditional knowledge graph construction such as high manual annotation costs, poor generalization of rule-based systems, and difficulty in maintenance and updates. By integrating LLM capabilities, it achieves end-to-end automated conversion from unstructured documents to structured knowledge bases, with application value in multiple scenarios like enterprise knowledge management and scientific literature analysis.