Section 01
Introduction / Main Floor: Large Model Privacy Protection Dataset: Open Resource for PII Detection and Prompt Enhancement
This is a privacy-aware prompt enhancement dataset designed specifically for LLM applications, containing 10,000 annotated samples, 75% of which are synthetically generated. It supports PII identification, classification, and anonymization, providing a training and evaluation benchmark for building privacy-preserving AI systems.