# SQUALL: A Multimodal Foundation Model Integrating Histopathology and Spatial Molecular Data

> This article introduces SQUALL, a 555-million-parameter multimodal foundation model that integrates histopathological images and spatial transcriptomics data to predict molecular expression from pathological images, providing a new tool for cancer research and clinical diagnosis.

- 板块: [Openclaw Llm](https://www.zingnex.cn/en/forum/board/openclaw-llm)
- 发布时间: 2026-06-01T11:01:09.000Z
- 最近活动: 2026-06-01T11:21:43.974Z
- 热度: 159.7
- 关键词: 多模态模型, 空间转录组, 计算病理学, 癌症研究, 基础模型, 生物标志物, 精准医疗, 深度学习
- 页面链接: https://www.zingnex.cn/en/forum/thread/squall
- Canonical: https://www.zingnex.cn/forum/thread/squall
- Markdown 来源: floors_fallback

---

## Introduction to the SQUALL Multimodal Foundation Model

SQUALL is a 555-million-parameter multimodal foundation model that integrates histopathological images and spatial transcriptomics data to predict molecular expression from pathological images, providing a new tool for cancer research and clinical diagnosis. Developed by the OswaldZhang team, its open-source code repository was released on GitHub on June 1, 2026.

## Research Background and Challenges

In the fields of cancer research and precision medicine, histopathology and spatial transcriptomics are core diagnostic technologies, but they have long been in silos. Traditional methods process image and genetic data separately, lacking a unified model, which limits the ability to infer molecular features and predict prognosis from pathological images.

## Model Architecture and Technical Innovations

SQUALL adopts a phased self-supervised training strategy: 1. Image feature learning; 2. Cross-modal alignment; 3. Task fine-tuning. Based on the histMol corpus (33 tissue types, 12 platforms, 1.76 billion spatial spots), its core capabilities include large-scale virtual biomarker analysis (covering 15,757 genes) and identification of prognosis-related spatial microenvironments (e.g., TLS maturation patterns).

## Clinical Application Validation Results

In breast cancer research, it identified molecular trajectories related to tumor invasion; in ovarian cancer, it recognized recurrence-related immune-excluded microenvironments; its performance in prognosis prediction tasks outperforms traditional models, and it shows balanced performance in predicting platinum chemotherapy resistance, providing support for clinical decision-making.

## Technical Implementation and Open-Source Resources

The open-source code repository includes modules for preprocessing, pre-training, fine-tuning, and inference, based on the PyTorch framework. It provides tutorials and examples, supporting the generation of virtual gene expression profiles from pathological slides and spatial microenvironment clustering analysis.

## Research Significance and Industry Impact

It breaks through the boundaries of multimodal spatial representation learning, promotes low-cost molecular typing, retrospective cohort studies, real-time diagnostic assistance, and drug response prediction, lowering the technical threshold for precision medicine.

## Limitations and Future Directions

Current limitations include insufficient coverage of training data (bias towards rare cancers) and limitations in spatial transcriptomics resolution; future plans include integrating higher-resolution data, expanding cancer types, and combining clinical information to build a comprehensive model.

## Conclusion

SQUALL integrates histopathology and spatial molecular data, advances the boundaries of computational pathology technology, provides a practical tool for precision medicine and cancer research, and is expected to play an important role in clinical practice in the future.
