Section 01
Introduction: A Systematic Empirical Study on the Impact of Sampling Temperature on Hallucinations in RAG Systems
This study focuses on the impact of the sampling temperature parameter on hallucinations in large language models (LLMs) within Retrieval-Augmented Generation (RAG) systems. By constructing a complete experimental framework and conducting empirical analysis using the Meta-Llama-3.1-8B-Instruct model, it aims to provide data support for understanding the factual reliability of LLMs and optimizing model configurations in production environments. The research covers data preparation, RAG pipeline, evaluation scripts, statistical analysis, and other links, emphasizing reproducibility and a pragmatic orientation.