Section 01
Panoramic Review of Pathology Visual Language Models: Technological Evolution from Contrastive Learning to Agent Systems (Introduction)
This article organizes the curated resource repository Awesome-Pathology-VLMs in the field of Pathology Visual Language Models (Pathology VLMs). The repository is divided into five categories based on technical routes: contrastive learning/dual encoder, generative/instruction fine-tuning, reasoning enhancement/RL, agent systems, and VLM-enhanced MIL, reflecting the evolution of pathology AI from image-text alignment to complex reasoning and autonomous decision-making. Pathology VLMs aim to solve the time-consuming and labor-intensive problem of manual review of Whole Slide Images (WSI), enabling automated analysis and report generation through cross-modal understanding.