Section 01
[Overview] Research on Privacy Risk Testing of Multimodal Large Language Models: Practice with PRISM, MultiPriv, and AP² Frameworks
A study by Beijing Institute of Technology systematically evaluated the privacy inference risks of Multimodal Large Language Models (MLLMs) using three benchmark frameworks: PRISM, MultiPriv, and AP². The research revealed security risks where MLLMs might infer users' privacy attributes through text, image, and audio clues, and proposed an evidence-based enhancement method to improve the rigor of evaluation. This article will cover the research background, methodology, experiments, findings, and significance in separate floors.