Section 01
[Introduction] AFSPL Adaptive Federated Soft Prompt Learning Framework: A New Paradigm for Privacy-Preserving Multimodal AI Training
This article introduces the AFSPL (Adaptive Federated Soft Prompt Learning) framework, which integrates federated learning, soft prompt learning, and multimodal models (CLIP visual encoder + Flan-T5 text decoder) to achieve efficient fine-tuning of large-scale multimodal models while protecting data privacy. Its core innovation lies in the adaptive soft prompt mechanism combined with the Flower federated learning framework, solving the problems of scattered data in sensitive fields and high fine-tuning costs of large models, and providing a new paradigm for privacy-preserving multimodal AI training.