Section 01
UniEdit: Guide to the Unified Evaluation Benchmark for Knowledge Editing in Large Language Models
UniEdit is a unified knowledge editing evaluation benchmark for large language models, featuring 311,000 samples and covering 25 knowledge domains. It systematically evaluates knowledge editing algorithms from three dimensions: reliability, generalization, and locality. It addresses the limitations of existing benchmarks, such as narrow coverage, single structure, and incomplete evaluation criteria, providing a standardized evaluation tool for the field and promoting the development of knowledge editing technology.