Section 01
[Introduction] LLM-HPC-Course: Practical Course on Distributed Training and Inference of Large Models on Supercomputing Platforms
LLM-HPC-Course is an open-source course project developed by HichamAgueny, designed for HPC environments, systematically explaining distributed training and inference of large models on supercomputing clusters. Using PyTorch as the framework and LLaMA model as the core case, the course covers distributed training, model fine-tuning, text summarization, and question-answering tasks, helping researchers and engineers efficiently conduct LLM research.