Table of Contents
Fetching ...

Are Linear Regression Models White Box and Interpretable?

Ahmed M Salih, Yuhe Wang

TL;DR

Are Linear Regression Models White Box and Interpretable? challenges the prevailing view that linear regression is inherently transparent within XAI. It surveys core XAI concepts, identifies eight challenges that can undermine LRM interpretability, and offers concrete recommendations to treat LRMs with comparable explainability rigor as complex models. The work emphasizes the need for local explanations, cautious interpretation of coefficients under collinearity and covariates, improved uncertainty quantification, post-hoc attribution, and fairness considerations to make LRMs reliable in high-stakes applications.

Abstract

Explainable artificial intelligence (XAI) is a set of tools and algorithms that applied or embedded to machine learning models to understand and interpret the models. They are recommended especially for complex or advanced models including deep neural network because they are not interpretable from human point of view. On the other hand, simple models including linear regression are easy to implement, has less computational complexity and easy to visualize the output. The common notion in the literature that simple models including linear regression are considered as "white box" because they are more interpretable and easier to understand. This is based on the idea that linear regression models have several favorable outcomes including the effect of the features in the model and whether they affect positively or negatively toward model output. Moreover, uncertainty of the model can be measured or estimated using the confidence interval. However, we argue that this perception is not accurate and linear regression models are not easy to interpret neither easy to understand considering common XAI metrics and possible challenges might face. This includes linearity, local explanation, multicollinearity, covariates, normalization, uncertainty, features contribution and fairness. Consequently, we recommend the so-called simple models should be treated equally to complex models when it comes to explainability and interpretability.

Are Linear Regression Models White Box and Interpretable?

TL;DR

Are Linear Regression Models White Box and Interpretable? challenges the prevailing view that linear regression is inherently transparent within XAI. It surveys core XAI concepts, identifies eight challenges that can undermine LRM interpretability, and offers concrete recommendations to treat LRMs with comparable explainability rigor as complex models. The work emphasizes the need for local explanations, cautious interpretation of coefficients under collinearity and covariates, improved uncertainty quantification, post-hoc attribution, and fairness considerations to make LRMs reliable in high-stakes applications.

Abstract

Explainable artificial intelligence (XAI) is a set of tools and algorithms that applied or embedded to machine learning models to understand and interpret the models. They are recommended especially for complex or advanced models including deep neural network because they are not interpretable from human point of view. On the other hand, simple models including linear regression are easy to implement, has less computational complexity and easy to visualize the output. The common notion in the literature that simple models including linear regression are considered as "white box" because they are more interpretable and easier to understand. This is based on the idea that linear regression models have several favorable outcomes including the effect of the features in the model and whether they affect positively or negatively toward model output. Moreover, uncertainty of the model can be measured or estimated using the confidence interval. However, we argue that this perception is not accurate and linear regression models are not easy to interpret neither easy to understand considering common XAI metrics and possible challenges might face. This includes linearity, local explanation, multicollinearity, covariates, normalization, uncertainty, features contribution and fairness. Consequently, we recommend the so-called simple models should be treated equally to complex models when it comes to explainability and interpretability.
Paper Structure (14 sections, 2 equations, 8 figures)

This paper contains 14 sections, 2 equations, 8 figures.

Figures (8)

  • Figure 1: Linear vs non-linear association.
  • Figure 2: Global explanation vs local explanation.
  • Figure 3: Multicollinearity and independent features.
  • Figure 4: The association of covariates with the input and the output.
  • Figure 5: Data pre-processing.
  • ...and 3 more figures