Section 01
SharedRequest: Guide to Model-Agnostic Privacy-Preserving Inference Scheme for LLMs
SharedRequest proposes an innovative privacy-preserving inference framework that identifies and filters sensitive information from user queries before they enter large language models (LLMs) using a model-agnostic discrimination method, balancing data privacy and model utility. Addressing the pain points of existing privacy-preserving technologies (high overhead of homomorphic encryption, quality degradation from differential privacy), this scheme adopts a layered processing architecture, fully decoupled from the underlying LLM, and can flexibly adapt to various language models, providing an efficient and secure solution for AI applications in sensitive scenarios.