Section 01
【Introduction】Core Summary of the Comparative Study Between Traditional NLP and LLM in Privacy Policy Classification
The core research topic of this article is to compare the performance of traditional NLP machine learning models (e.g., TF-IDF + SVM) and large language models (LLM) in the multi-label classification task of privacy policies. Using the classic OPP-115 dataset, the study focuses on the performance differences between models in scenarios with class imbalance, and finally reveals the significant advantages of traditional methods in this task. The study aims to answer: In privacy policy classification, which is better—traditional methods or LLM? This question involves multiple considerations such as technology selection, resource efficiency, interpretability, and deployment costs.