Zing Forum

Reading

SQUALL: A Multimodal Foundation Model Integrating Histopathology and Spatial Molecular Data

This article introduces SQUALL, a 555-million-parameter multimodal foundation model that integrates histopathological images and spatial transcriptomics data to predict molecular expression from pathological images, providing a new tool for cancer research and clinical diagnosis.

多模态模型空间转录组计算病理学癌症研究基础模型生物标志物精准医疗深度学习
Published 2026-06-01 19:01Recent activity 2026-06-01 19:21Estimated read 5 min
SQUALL: A Multimodal Foundation Model Integrating Histopathology and Spatial Molecular Data
1

Section 01

Introduction to the SQUALL Multimodal Foundation Model

SQUALL is a 555-million-parameter multimodal foundation model that integrates histopathological images and spatial transcriptomics data to predict molecular expression from pathological images, providing a new tool for cancer research and clinical diagnosis. Developed by the OswaldZhang team, its open-source code repository was released on GitHub on June 1, 2026.

2

Section 02

Research Background and Challenges

In the fields of cancer research and precision medicine, histopathology and spatial transcriptomics are core diagnostic technologies, but they have long been in silos. Traditional methods process image and genetic data separately, lacking a unified model, which limits the ability to infer molecular features and predict prognosis from pathological images.

3

Section 03

Model Architecture and Technical Innovations

SQUALL adopts a phased self-supervised training strategy: 1. Image feature learning; 2. Cross-modal alignment; 3. Task fine-tuning. Based on the histMol corpus (33 tissue types, 12 platforms, 1.76 billion spatial spots), its core capabilities include large-scale virtual biomarker analysis (covering 15,757 genes) and identification of prognosis-related spatial microenvironments (e.g., TLS maturation patterns).

4

Section 04

Clinical Application Validation Results

In breast cancer research, it identified molecular trajectories related to tumor invasion; in ovarian cancer, it recognized recurrence-related immune-excluded microenvironments; its performance in prognosis prediction tasks outperforms traditional models, and it shows balanced performance in predicting platinum chemotherapy resistance, providing support for clinical decision-making.

5

Section 05

Technical Implementation and Open-Source Resources

The open-source code repository includes modules for preprocessing, pre-training, fine-tuning, and inference, based on the PyTorch framework. It provides tutorials and examples, supporting the generation of virtual gene expression profiles from pathological slides and spatial microenvironment clustering analysis.

6

Section 06

Research Significance and Industry Impact

It breaks through the boundaries of multimodal spatial representation learning, promotes low-cost molecular typing, retrospective cohort studies, real-time diagnostic assistance, and drug response prediction, lowering the technical threshold for precision medicine.

7

Section 07

Limitations and Future Directions

Current limitations include insufficient coverage of training data (bias towards rare cancers) and limitations in spatial transcriptomics resolution; future plans include integrating higher-resolution data, expanding cancer types, and combining clinical information to build a comprehensive model.

8

Section 08

Conclusion

SQUALL integrates histopathology and spatial molecular data, advances the boundaries of computational pathology technology, provides a practical tool for precision medicine and cancer research, and is expected to play an important role in clinical practice in the future.