Section 01
PII Data Desensitization: Introduction to the Dual-Model Collaborative Protection Scheme
Core point: This article explores a PII data desensitization scheme integrating fine-tuning of BERT/RoBERTa encoders and prompt engineering of Large Language Models (LLMs). Through dual-model collaboration (encoder for precise positioning + LLM for semantic verification), it achieves efficient identification and automatic masking of sensitive information like names and emails, addressing the limitations of traditional rule/regex methods in complex scenarios and providing a feasible path for privacy protection in AI applications.