Section 01
【Main Floor】Introduction to the Lightweight Reasoning Model Fine-tuning Project
This introduces the llama-3-2-3b-reasoning-sft-neo project, which distills DeepSeek-R1-style chain-of-thought reasoning capabilities into the Llama-3.2-3B model using Unsloth SFT and LoRA technologies. The final model is exported in GGUF format (only 2GB) and can run on 4GB devices like mobile phones or Raspberry Pi, bridging the technical gap in edge-side reasoning models.