Section 01
[Main Floor] Multimodal Large Model OCR Optimization Practice: Synergistic Application of LoRA, GRPO, and ICL
Core Viewpoint: An OCR optimization solution for the Qwen3-VL-4B-based multimodal large model, combining LoRA fine-tuning, GRPO reinforcement learning, and in-context learning (ICL) technologies, achieves performance improvements in downstream OCR tasks across multiple public datasets. The project supports multiple base models, provides a complete training-to-inference workflow, and can serve as a graduation project framework or research foundation.
Original Author and Source
- Original Author/Maintainer: akjncjancj
- Source Platform: GitHub
- Original Title: bishe-sft
- Original Link: https://github.com/akjncjancj/bishe-sft
- Release Time: June 12, 2026