Zing Forum

Reading

LLMFuxian: Reproduction of Multimodal Large Model for Mechanical Fault Diagnosis in Intelligent Operation and Maintenance

Code repository for reproducing the paper 'Multimodal data-enabled large model for machine fault diagnosis towards intelligent operation and maintenance', focusing on single-dataset experiments using the HUSTbearing dataset.

故障诊断多模态大模型智能运维轴承数据集知识图谱零样本学习PyTorch工业AI
Published 2026-05-23 16:47Recent activity 2026-05-23 16:55Estimated read 5 min
LLMFuxian: Reproduction of Multimodal Large Model for Mechanical Fault Diagnosis in Intelligent Operation and Maintenance
1

Section 01

Introduction / Main Floor: LLMFuxian: Reproduction of Multimodal Large Model for Mechanical Fault Diagnosis in Intelligent Operation and Maintenance

Code repository for reproducing the paper 'Multimodal data-enabled large model for machine fault diagnosis towards intelligent operation and maintenance', focusing on single-dataset experiments using the HUSTbearing dataset.

2

Section 02

Original Author and Source

  • Original Author/Maintainer: wangwen2077-boop
  • Source Platform: GitHub
  • Original Title: llmfuxian
  • Original Link: https://github.com/wangwen2077-boop/llmfuxian
  • Release Date: May 23, 2026
  • Tech Stack: Python, PyTorch
  • Original Paper Title: Multimodal data-enabled large model for machine fault diagnosis towards intelligent operation and maintenance
3

Section 03

Project Overview

LLMFuxian is an academic paper reproduction project aimed at reproducing the methods in the paper 'Multimodal data-enabled large model for machine fault diagnosis towards intelligent operation and maintenance'. This project focuses on the field of intelligent operation and maintenance of industrial equipment, using multimodal large model technology to achieve automatic diagnosis of mechanical faults.

Currently, it has mainly completed the reproduction of single-dataset experiments using the HUSTbearing dataset from the paper, and does not yet include the gear dataset, FRC/CAME external baselines, or complete ablation experiments.

4

Section 04

Challenges in Industrial Equipment Fault Diagnosis

In modern industrial production, the health status of rotating machinery (such as bearings and gearboxes) directly affects production efficiency and safety. Traditional fault diagnosis methods face the following challenges:

  1. Data Heterogeneity: Various sensor data (vibration signals, temperature, current, etc.) have different formats
  2. Complex Fault Types: From single faults to compound faults, with diverse patterns
  3. Scarce Labeled Data: Fault samples are difficult to obtain, especially in compound fault scenarios
  4. Cross-Working Condition Generalization: Difficulties in model transfer under different speed and load conditions
5

Section 05

Solution Approach of Multimodal Large Model

The LMM-FD (Large Multimodal Model for Fault Diagnosis) method proposed in the paper combines the knowledge understanding ability of large language models with time series analysis, and enhances entity representation through knowledge graphs to achieve zero-shot compound fault diagnosis capability.

6

Section 06

Current Reproduction Status

The project has completed the following core functions:

7

Section 07

Data Preprocessing

  • HUSTbearing data reading and parsing
  • Tab-delimited text parsing of original .xls files
  • Filtering of three working conditions: 20Hz / 40Hz / 60Hz
  • Window segmentation processing of vibration signals for X/Y/Z three axes
8

Section 08

Dataset Division Strategy

A carefully designed training/test division strategy is adopted:

  • Training/Test Data: Healthy state (healthy) + single fault (single-fault)
  • Zero-shot Validation: Compound fault (compound fault) time series are isolated separately

This design simulates real industrial scenarios—compound fault samples are scarce, and the model needs to perform zero-shot inference on unseen compound fault types.