Zing Forum

Reading

Beyond Accuracy: An Explainable AI-Driven Student Academic Performance Prediction System

This article introduces a graduation thesis research combining machine learning, deep learning, and Explainable AI (XAI), focusing on student academic performance prediction. The project uses LSTM, Bi-LSTM, and diffusion models, along with explainability methods like SHAP, LIME, and DICE, as well as the fairness assessment tool fairsynedu, to build an educational AI system that balances accuracy and explainability.

可解释AIXAILSTM学生表现预测SHAPLIME教育AI公平性扩散模型
Published 2026-06-15 14:15Recent activity 2026-06-15 14:20Estimated read 5 min
Beyond Accuracy: An Explainable AI-Driven Student Academic Performance Prediction System
1

Section 01

[Introduction] Beyond Accuracy: An Explainable AI-Driven Student Academic Performance Prediction System

This article introduces a student academic performance prediction study combining deep learning (LSTM, Bi-LSTM, diffusion models) and Explainable AI (SHAP, LIME, DICE, etc.), while integrating the fairness assessment tool fairsynedu, aiming to build an educational AI system that balances accuracy and explainability. The project source code and related materials are available on GitHub (link: https://github.com/Istiaq-Alam/Student-Performance-Prediction-with-Explainable-AI).

2

Section 02

Research Background and Core Issues

Traditional student academic prediction models often focus on accuracy but ignore decision transparency and explainability, making it impossible for educators to take effective interventions based on prediction results. The core question of this study: How to build an AI system that can accurately predict student performance while providing explainable and actionable insights?

3

Section 03

Core Technologies and Model Architecture

Deep Learning Models: Uses LSTM (captures long-term dependencies in time series), Bi-LSTM (bidirectional context), diffusion models (data augmentation and synthetic samples); Explainable AI Methods: SHAP (feature contribution), LIME (single prediction explanation), DICE (diverse counterfactuals), CEM (minimum perturbation counterfactuals); Fairness Assessment: Uses the fairsynedu tool to detect biases in educational data and avoid systemic discrimination.

4

Section 04

Dataset Description

The study uses the classic Student Performance Data Set (Cortez and Silva, 2008), which includes math and Portuguese language grades data of Portuguese middle school students, covering dimensions: demographic characteristics (age, gender), socioeconomic data (parental education/occupation), behavioral characteristics (study time, extracurricular activities), and historical performance (past grades, absences).

5

Section 05

Web Application Features and Interactive Experience

The project develops an interactive dashboard based on Flask API, with features including: 1. Prediction function (input student features to get performance predictions); 2. Explainability display (visualization of SHAP values, LIME explanations); 3. Human-machine collaborative feedback (educators can provide feedback on prediction results to continuously optimize the model).

6

Section 06

Research Contributions and Practical Value

Academic Contributions: 1. Introduce diffusion models into the field of educational prediction; 2. Systematically compare the performance of multiple XAI methods in educational scenarios; 3. Incorporate fairness assessment into the model development process. Practical Value: 1.Early warning of students at academic risk;2.Provide personalized learning recommendations;3.Provide decision support for educational managers.

7

Section 07

Limitations and Future Research Directions

Current Limitations: Geographical limitation of the dataset (Portugal), high computational cost of counterfactual explanations, controversial selection of fairness metrics. Future Directions: Multimodal data fusion (learning logs/homework patterns), real-time prediction systems, cross-cultural validation, evolution from correlation to causal inference.