Section 01
Introduction: JiraiBench—the First Bilingual Evaluation Benchmark for Self-Harm Behavior Detection in Jirai Subculture Communities
JiraiBench is the first bilingual (Chinese and Japanese) evaluation benchmark specifically for detecting self-harm content in Jirai subculture communities. It aims to provide a standardized test set to assess the ability of large language models to identify potential mental health risk content, filling the gap in the lack of systematic evaluation standards for traditional moderation systems and existing large models in this field.