Section 01
Green Shielding: Building a User-Centric New Framework for Trustworthy AI Evaluation (Introduction)
The research team proposes the Green Shielding method, which uses the CUE standard to evaluate large models' sensitivity to daily input changes. In the field of medical diagnosis, it was found that prompt-level factors systematically affect the clinically relevant attributes of model outputs. This framework emphasizes shifting from adversarial testing to user-centric evaluation, focusing on the impact of real users' diverse expression styles on model behavior, and providing evidence-based guidance for AI deployment.