Section 01
[Introduction] Systematic Review and Resource Compilation of Multimodal Large Language Models in Low-Level Vision
This GitHub resource compilation comprehensively sorts out the applications of multimodal large language models in low-level vision tasks, covering core technical directions such as visual encoder adaptation, language branch optimization, output head design, and parameter-efficient fine-tuning. It also organizes cutting-edge progress in extended application fields like medical image processing and remote sensing data handling, providing valuable references for researchers and developers.