Section 01
[Main Post] Introduction to Deep Understanding of the Mathematical Foundations of Large Language Models: From Gradients to Hallucinations
Behind the amazing capabilities of Large Language Models (LLMs) lies a sophisticated set of mathematical frameworks. This article will delve into the mathematical principles from gradient optimization to the formation mechanism of hallucination phenomena, helping readers establish a systematic understanding of the working mechanism of LLMs.