Table of Contents
Fetching ...

DeforestVis: Behavior Analysis of Machine Learning Models with Surrogate Decision Stumps

Angelos Chatzimparmpas, Rafael M. Martins, Alexandru C. Telea, Andreas Kerren

TL;DR

DeforestVis tackles interpretability of complex ML models by employing AdaBoost-derived one-level decision stumps as surrogate explanations, enabling a visual analytics workflow that trades off surrogate complexity against fidelity. The tool provides linked views to summarize surrogate behavior, compare rules, and investigate individual test cases, with capabilities to override rules and assess their global impact. Evaluations with five ML experts on real healthcare datasets show the approach is promising for understanding and communicating model behavior, while revealing limitations in scalability and the need for future enhancements such as richer fidelity metrics and bi-directional updates to the target model. Overall, DeforestVis offers a practical framework for domain experts and developers to interrogate, refine, and communicate complex model decisions through compact, interpretable surrogates.

Abstract

As the complexity of machine learning (ML) models increases and their application in different (and critical) domains grows, there is a strong demand for more interpretable and trustworthy ML. A direct, model-agnostic, way to interpret such models is to train surrogate models-such as rule sets and decision trees-that sufficiently approximate the original ones while being simpler and easier-to-explain. Yet, rule sets can become very lengthy, with many if-else statements, and decision tree depth grows rapidly when accurately emulating complex ML models. In such cases, both approaches can fail to meet their core goal-providing users with model interpretability. To tackle this, we propose DeforestVis, a visual analytics tool that offers summarization of the behaviour of complex ML models by providing surrogate decision stumps (one-level decision trees) generated with the Adaptive Boosting (AdaBoost) technique. DeforestVis helps users to explore the complexity versus fidelity trade-off by incrementally generating more stumps, creating attribute-based explanations with weighted stumps to justify decision making, and analysing the impact of rule overriding on training instance allocation between one or more stumps. An independent test set allows users to monitor the effectiveness of manual rule changes and form hypotheses based on case-by-case analyses. We show the applicability and usefulness of DeforestVis with two use cases and expert interviews with data analysts and model developers.

DeforestVis: Behavior Analysis of Machine Learning Models with Surrogate Decision Stumps

TL;DR

DeforestVis tackles interpretability of complex ML models by employing AdaBoost-derived one-level decision stumps as surrogate explanations, enabling a visual analytics workflow that trades off surrogate complexity against fidelity. The tool provides linked views to summarize surrogate behavior, compare rules, and investigate individual test cases, with capabilities to override rules and assess their global impact. Evaluations with five ML experts on real healthcare datasets show the approach is promising for understanding and communicating model behavior, while revealing limitations in scalability and the need for future enhancements such as richer fidelity metrics and bi-directional updates to the target model. Overall, DeforestVis offers a practical framework for domain experts and developers to interrogate, refine, and communicate complex model decisions through compact, interpretable surrogates.

Abstract

As the complexity of machine learning (ML) models increases and their application in different (and critical) domains grows, there is a strong demand for more interpretable and trustworthy ML. A direct, model-agnostic, way to interpret such models is to train surrogate models-such as rule sets and decision trees-that sufficiently approximate the original ones while being simpler and easier-to-explain. Yet, rule sets can become very lengthy, with many if-else statements, and decision tree depth grows rapidly when accurately emulating complex ML models. In such cases, both approaches can fail to meet their core goal-providing users with model interpretability. To tackle this, we propose DeforestVis, a visual analytics tool that offers summarization of the behaviour of complex ML models by providing surrogate decision stumps (one-level decision trees) generated with the Adaptive Boosting (AdaBoost) technique. DeforestVis helps users to explore the complexity versus fidelity trade-off by incrementally generating more stumps, creating attribute-based explanations with weighted stumps to justify decision making, and analysing the impact of rule overriding on training instance allocation between one or more stumps. An independent test set allows users to monitor the effectiveness of manual rule changes and form hypotheses based on case-by-case analyses. We show the applicability and usefulness of DeforestVis with two use cases and expert interviews with data analysts and model developers.
Paper Structure (15 sections)