Reading

Impact of Missing Data on Model Inference: A Stability Study Based on Explainable AI

A study on how missing data affects the inference process of machine learning models, using explainable AI (XAI) techniques to analyze the decision stability of models when facing incomplete data, providing a theoretical basis for data quality assessment in practical applications.

缺失数据可解释AIXAI模型鲁棒性SHAP机器学习数据质量模型解释特征重要性AI稳定性

Published 2026-04-20 23:03Recent activity 2026-04-20 23:27Estimated read 9 min

Section 01

[Main Floor] Introduction to Impact of Missing Data on Model Inference: A Stability Study Based on Explainable AI

This study focuses on the prevalent missing data problem in the real world, systematically exploring its impact on the decision stability of machine learning models through explainable AI (XAI) techniques. Core viewpoints include: missing data may not only reduce model accuracy but also lead to unstable inference processes; high prediction accuracy does not necessarily mean reliable explanations; missing patterns (rather than just proportions) have a greater impact on models; there are significant differences in the robustness of different types of models to missing data. The study provides a theoretical basis and practical framework for data quality assessment and trustworthy AI construction.

Section 02

Research Background and Motivation

Prevalence of Missing Data

Missing data arises from various reasons and is classified into three types: Missing Completely at Random (MCAR), Missing at Random (MAR), and Missing Not at Random (MNAR). Traditional processing methods (such as deletion and mean imputation) focus on data completion but ignore the impact of missingness on model inference.

Rise of Explainable AI

The application of deep learning in key fields has promoted the development of XAI techniques (e.g., SHAP, LIME), which can reveal the decision logic of models.

Research Innovation

This study extends XAI from explaining individual predictions to evaluating inference stability under missing data, filling gaps in related fields.

Section 03

Research Methodology

Core Questions

Changes in prediction accuracy; 2. Stability of feature importance ranking; 3. Decision boundary drift; 4. Maintenance of explanation consistency.

Experimental Design

Datasets: UCI medical, financial, and social survey datasets (10-100+ dimensions, thousands to tens of thousands of samples);
Model Types: Traditional ML (random forest, SVM, etc.), deep learning (MLP, etc.), ensemble models (XGBoost, etc.);
Missing Simulation: Random feature missing (10%/30%/50%/70%), structured missing, progressive missing;
XAI Methods: SHAP, Permutation Importance, Partial Dependence Plots.

Stability Indicators

Prediction stability: Prediction variance, confidence change;
Explanation stability: Kendall Tau coefficient (feature ranking), Jensen-Shannon divergence (SHAP distribution);
Decision boundary stability: Geometric change, adversarial sample sensitivity.

Section 04

Key Findings

Finding 1: Decoupling of Accuracy and Explanation Stability

Models may maintain high accuracy but have unstable explanations (e.g., the AUC of a medical model only drops by 0.07, but feature ranking changes significantly), suggesting that models may rely on spurious correlations.

Finding 2: Differences in Model Robustness

Tree ensemble models: Robust to random missing, sensitive to structured missing, and feature importance is prone to spikes;
Neural networks: Sensitive to continuous feature missing, explanation stability decreases faster, but can implicitly compensate through feature representation.

Finding 3: Feature Importance Illusion

When key features are missing, models shift weights to proxy variables or amplify noise features, leading to incorrect explanations (e.g., relying on non-causal features in medical diagnosis).

Finding 4: Missing Patterns Matter More

10% missing key features has a greater impact than 50% missing marginal features; structured missing > random missing; MNAR patterns have the most severe bias.

Section 05

Practical Implications and Recommendations

For Model Developers

Incorporate missing robustness into model selection criteria, simulate missing scenarios to evaluate stability curves;
Monitor XAI stability indicators, trigger alerts when feature ranking changes significantly;
Quantify uncertainty (e.g., Bayesian NN, MC Dropout).

For Business Applications

Build a data quality dashboard to monitor missing ratios and patterns;
Layered decision-making: Direct prediction for high completeness, manual review for medium completeness, reject prediction for low completeness;
Label explanation credibility, remind of potential risks when data is incomplete.

Section 06

Limitations and Future Directions

Current Limitations

Static analysis: Does not involve the impact of missing data during the training process;
Feature independence assumption: Does not fully consider complex correlations in real data;
XAI method limitations: SHAP and others have computational costs and approximation errors.

Future Directions

Active missing handling: Decide which features to collect (balance cost and gain);
Causal perspective: Distinguish between correlation and causality to avoid explanation illusions;
Dynamic missing adaptation: Develop adaptive models to adjust inference strategies;
Human-in-the-loop: Optimize feature query strategies (reduce cognitive burden).

Section 07

Summary

This study reveals the deep impact of missing data on model inference through rigorous experiments: it not only reduces accuracy but also undermines inference stability. The core conclusion is that high accuracy does not equal reliable explanation; in key applications, both prediction performance and explanation stability need to be monitored simultaneously. Data quality should run through the entire model lifecycle, and understanding model behavior when information is incomplete is a necessary step to build trustworthy AI. This study provides warnings and evaluation frameworks for related practices.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

Building an AWS Generative AI Application from Scratch: EC2 + Bedrock Hands-On Tutorial

A complete cloud-native AI application development guide for beginners, building a simple generative AI chatbot using Amazon EC2, Apache, Python CGI, and Amazon Bedrock, covering architecture design, IAM permission configuration, security best practices, and cost optimization suggestions.

Recent activity 2026-06-02 19:49