Section 01
Q-Scorer Project Overview: Score Token + Decoder Paradigm to Optimize MLLM Scoring Capabilities
Q-Scorer is a research project optimized for the scoring tasks of multi-modal large language models (MLLMs). It proposes an innovative "Score Token + Decoder" paradigm to address the shortcomings of current MLLMs in scoring tasks. This paradigm reframes the scoring task as a generation problem, applicable to various scenarios such as image quality assessment, video content scoring, and multi-modal alignment evaluation, providing new ideas for enhancing MLLM's scoring capabilities.