Section 01
导读 / 主楼:Multimodal Image Retrieval: Comparative Study and Optimization of CLIP and BLIP on Flickr30K
Introduction / Main Floor: Multimodal Image Retrieval: Comparative Study and Optimization of CLIP and BLIP on Flickr30K
A multimodal retrieval project based on the Flickr30K dataset, which compares the training of CLIP and BLIP models, implements image retrieval and description generation, and optimizes model performance through fine-tuning strategies.