Analyzing Local Fidelity of XAI Methods for Tabular Data

Abstract

Explainable AI aims to make black-box model decisions understandable, with local explanations being a common approach for interpreting individual predictions. Local fidelity, which is the alignment between an explanation and the black-box model in the explained instance's neighborhood, is an important property of such explanations. This study investigates local fidelity of local explanations by evaluating common explanation methods on regression and classification tasks. We contextualize local fidelity with the complexity of the underlying black-box model. Additionally, we assess whether local explanations provide meaningful value over trivial baseline approaches. Furthermore, we analyze neighborhood sizes in which explanations remain accurate. Our results show a significant divergence: local fidelity is high only for simple models, where explanations may be unnecessary. Conversely, for complex models, where interpretability is essential, local explanations mostly fail to accurately capture model behavior. For classification tasks, local explanations often provide limited additional insights into model behavior within small neighborhoods around individual predictions. While absolute local fidelity values vary by method and dataset, we consistently find that explanations remain accurate only in very small neighborhoods. These findings hold significant implications for practitioners and end-users, suggesting that local explanations may offer limited value for understanding complex model behavior beyond the explained instance.

XAI methods

LIME (default binary and continuos)
Saliency Maps
Integrated Gradients
SmoothGrad + Integrated Gradients

Datasets

Standard Datasets

TabularBenchmark

Synthetic Datasets

Using sklearns method: sklearn.datasets.make_classification Link to dataset
Custom Regression Synthetic Data make_custom_regression_data

Models

Deep Learning Models (PyTorch-Frame)

Gradient Boosting Models

XGBoost
LightGBM

Attribution

This repository contains code adapted from the python package PyTorch Frame (PyG-team).

Original source: GitHub link to original script
License: MIT (link)
Modifications include dataset adaptation for our specific use case.

Name		Name	Last commit message	Last commit date
Latest commit History 206 Commits
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
estimate_local_fidelity.py		estimate_local_fidelity.py
model_complexity.py		model_complexity.py
requirements.txt		requirements.txt
run_experiment_setup.py		run_experiment_setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Analyzing Local Fidelity of XAI Methods for Tabular Data

Abstract

XAI methods

Datasets

Standard Datasets

Synthetic Datasets

Models

Deep Learning Models (PyTorch-Frame)

Gradient Boosting Models

Attribution

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Analyzing Local Fidelity of XAI Methods for Tabular Data

Abstract

XAI methods

Datasets

Standard Datasets

Synthetic Datasets

Models

Deep Learning Models (PyTorch-Frame)

Gradient Boosting Models

Attribution

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages