The study may adopt the following technical path:
Data Preparation Phase
Collect and organize business process documents, including flowcharts, operation manuals, system descriptions, etc. These documents form the input for LLM analysis.
Prompt Engineering
Design specialized prompt templates to guide LLMs in identifying GDPR-related tasks. Prompts may include:
- Explanations of key GDPR clauses
- Examples of personal data types to focus on
- Specification requirements for output formats
Model Evaluation and Validation
Compare the identification results of LLMs with manual expert annotations, and calculate metrics such as accuracy and recall. Cross-validation may be used to ensure the reliability of the results.