Section 01
Introduction to the SCRuB Framework: Redefining the Evaluation of Social Concept Reasoning in Language Models
SCRuB (Social Concept Reasoning under Rubric-Based Evaluation) is an evaluation framework developed by Meta's research team. It aims to systematically assess the social concept reasoning capabilities of language models, with a particular focus on the quality of the reasoning process when models handle socially controversial issues. Through a multidisciplinary expert panel and structured rubrics, this framework breaks through the limitations of traditional evaluations that only focus on conclusions, shifting towards a process-oriented comprehensive assessment.