Section 01
Introduction: Reproduction of an LLM-Based Text Anonymization Project (ICLR 2025 Paper)
This article introduces a reproduction project based on an ICLR 2025 paper, focusing on using large language models (such as GPT-4o) to achieve high-quality text anonymization. The project aims to address the shortcomings of traditional anonymization methods, preserving the practical value of text while protecting privacy. Key results include a 95% entity recall rate on the TAB dataset (which includes European Court of Human Rights judgments).