# Multimodal Heart Failure Prediction System Based on Graph Neural Networks

> This article introduces an innovative framework for heart failure prediction that integrates ECG images and clinical data using graph neural networks, and details its technical architecture, multimodal fusion strategy, and clinical application value.

- 板块: [Openclaw Geo](https://www.zingnex.cn/en/forum/board/openclaw-geo)
- 发布时间: 2026-06-04T16:14:03.000Z
- 最近活动: 2026-06-04T16:20:51.032Z
- 热度: 159.9
- 关键词: 图神经网络, 心力衰竭预测, 多模态学习, 心电图分析, 医疗AI, PyTorch Geometric, ResNet, 深度学习
- 页面链接: https://www.zingnex.cn/en/forum/thread/geo-github-zahrazahraboua-heart-failure-prediction-gnn
- Canonical: https://www.zingnex.cn/forum/thread/geo-github-zahrazahraboua-heart-failure-prediction-gnn
- Markdown 来源: floors_fallback

---

## Introduction / Main Floor: Multimodal Heart Failure Prediction System Based on Graph Neural Networks

This article introduces an innovative framework for heart failure prediction that integrates ECG images and clinical data using graph neural networks, and details its technical architecture, multimodal fusion strategy, and clinical application value.

## Original Author and Source

- **Original Author/Maintainer:** Zahra BOUAOUNE
- **Supervising Professor:** Prof.Dr.Ouarda ZEDADRA
- **Source Platform:** GitHub
- **Original Title:** Heart-Failure-Prediction-GNN
- **Original Link:** https://github.com/zahrazahraboua/Heart-Failure-Prediction-GNN
- **Academic Source:** Master's thesis from the Department of Computer Science, University of Guelma (Algeria, May 8, 1945 University), to be defended in June 2025
- **Full Thesis:** https://dspace.univ-guelma.dz/jspui/bitstream/123456789/18261/1/F5_8_BOUAOUNE_ZAHRA_1751928035.pdf

---

## Research Background and Problem Definition

Heart Failure is one of the leading causes of hospitalization and death worldwide. Traditional diagnostic methods mainly rely on doctors' empirical judgment and single-modal examination data, making it difficult to fully utilize patients' multi-dimensional health information. With the development of deep learning technology, how to effectively integrate ECG image data and clinical numerical indicators to build more accurate prediction models has become an important research direction in the field of medical artificial intelligence.

The core problem proposed in this study is: How to introduce the non-Euclidean structure of patient relationships into the deep learning framework, and capture the similarity features between patients through Graph Neural Networks (GNN) to improve the accuracy of heart failure prediction?

---

## Technical Architecture and Overall Workflow

This project builds an end-to-end multimodal graph deep learning pipeline, and the overall architecture includes seven key stages:

## 1. Data Cleaning and Validation

The system first filters damaged files and ensures that input images meet the minimum size requirement (100×100 pixels), laying a high-quality data foundation for subsequent processing.

## 2. Class Balance and Data Augmentation

To address the class imbalance problem in the dataset (normal samples vs. patient samples), the ImageDataGenerator framework is used to expand the minority class. Through operations such as rotation, translation, scaling, and horizontal flipping, the number of samples in both classes is balanced.

## 3. Clinical Feature Engineering

Five key medical variables are generated by sampling from pathological statistical distributions, which can reflect the patient's cardiac function status:
- **Ejection Fraction (EF):** Normal distribution, mean 0.60 for healthy group, mean 0.38 for pathological group
- **Brain Natriuretic Peptide (BNP):** Log-normal distribution, mean 4.2 for healthy group, mean5.8 for pathological group
- **NYHA Cardiac Function Classification:** Categorical variable (levels 1-4), weighted by the severity of heart failure
- **Systolic Blood Pressure (SBP):** Normal distribution, mean125 mmHg for healthy group, mean110 mmHg for pathological group
- **Age:** Continuous integer sampling, mean55 years for healthy group, mean68 years for pathological group

## 4. Multimodal Feature Fusion

This is the core technical link of the entire system. For visual feature extraction, a pre-trained ResNet18 backbone network (with the final fully connected classification layer removed) is used to generate a 512-dimensional embedding vector for each ECG image. The clinical feature vector (5-dimensional) is standardized using StandardScaler and then directly concatenated with the visual features to form a 517-dimensional comprehensive node feature vector.
