Table of Contents
Fetching ...

What do we need to build explainable AI systems for the medical domain?

Andreas Holzinger, Chris Biemann, Constantinos S. Pattichis, Douglas B. Kell

TL;DR

The article reviews why explainable-AI is crucial in medicine and surveys a range of strategies to make AI decisions transparent across imaging, omics, and text data. It distinguishes ante-hoc and post-hoc explainability, illustrating methods from model-agnostic explanations to glass-box interpretable models, and discusses practical implementations in medical domains. A key emphasis is placed on AM-FM decompositions, hybrid symbolic-neural approaches, and human-in-the-loop interfaces to enhance trust and regulatory compliance. The work highlights regulatory pressures (e.g., GDPR), the need for task-relevant explanations, and future directions toward interpretable, Interactive clinical decision support systems.

Abstract

Artificial intelligence (AI) generally and machine learning (ML) specifically demonstrate impressive practical success in many different application domains, e.g. in autonomous driving, speech recognition, or recommender systems. Deep learning approaches, trained on extremely large data sets or using reinforcement learning methods have even exceeded human performance in visual tasks, particularly on playing games such as Atari, or mastering the game of Go. Even in the medical domain there are remarkable results. The central problem of such models is that they are regarded as black-box models and even if we understand the underlying mathematical principles, they lack an explicit declarative knowledge representation, hence have difficulty in generating the underlying explanatory structures. This calls for systems enabling to make decisions transparent, understandable and explainable. A huge motivation for our approach are rising legal and privacy aspects. The new European General Data Protection Regulation entering into force on May 25th 2018, will make black-box approaches difficult to use in business. This does not imply a ban on automatic learning approaches or an obligation to explain everything all the time, however, there must be a possibility to make the results re-traceable on demand. In this paper we outline some of our research topics in the context of the relatively new area of explainable-AI with a focus on the application in medicine, which is a very special domain. This is due to the fact that medical professionals are working mostly with distributed heterogeneous and complex sources of data. In this paper we concentrate on three sources: images, *omics data and text. We argue that research in explainable-AI would generally help to facilitate the implementation of AI/ML in the medical domain, and specifically help to facilitate transparency and trust.

What do we need to build explainable AI systems for the medical domain?

TL;DR

The article reviews why explainable-AI is crucial in medicine and surveys a range of strategies to make AI decisions transparent across imaging, omics, and text data. It distinguishes ante-hoc and post-hoc explainability, illustrating methods from model-agnostic explanations to glass-box interpretable models, and discusses practical implementations in medical domains. A key emphasis is placed on AM-FM decompositions, hybrid symbolic-neural approaches, and human-in-the-loop interfaces to enhance trust and regulatory compliance. The work highlights regulatory pressures (e.g., GDPR), the need for task-relevant explanations, and future directions toward interpretable, Interactive clinical decision support systems.

Abstract

Artificial intelligence (AI) generally and machine learning (ML) specifically demonstrate impressive practical success in many different application domains, e.g. in autonomous driving, speech recognition, or recommender systems. Deep learning approaches, trained on extremely large data sets or using reinforcement learning methods have even exceeded human performance in visual tasks, particularly on playing games such as Atari, or mastering the game of Go. Even in the medical domain there are remarkable results. The central problem of such models is that they are regarded as black-box models and even if we understand the underlying mathematical principles, they lack an explicit declarative knowledge representation, hence have difficulty in generating the underlying explanatory structures. This calls for systems enabling to make decisions transparent, understandable and explainable. A huge motivation for our approach are rising legal and privacy aspects. The new European General Data Protection Regulation entering into force on May 25th 2018, will make black-box approaches difficult to use in business. This does not imply a ban on automatic learning approaches or an obligation to explain everything all the time, however, there must be a possibility to make the results re-traceable on demand. In this paper we outline some of our research topics in the context of the relatively new area of explainable-AI with a focus on the application in medicine, which is a very special domain. This is due to the fact that medical professionals are working mostly with distributed heterogeneous and complex sources of data. In this paper we concentrate on three sources: images, *omics data and text. We argue that research in explainable-AI would generally help to facilitate the implementation of AI/ML in the medical domain, and specifically help to facilitate transparency and trust.

Paper Structure

This paper contains 11 sections, 7 equations, 4 figures.

Figures (4)

  • Figure 1: Four cases illustrating how the "expert" $p(\boldsymbol{x})$ affects the prototype $\boldsymbol{x}^\star$ found by activation maximization. The horizontal axis represents the input space, and the vertical axis represents the probability (extreme case a: expert is absent, extreme case d: expert is overfitted; Image source: MontavonSamekMueller:2017:InterpertingDL.
  • Figure 2: Multi-scale AM-FM decomposition based on fixed scales (non-adaptive). In the top three rows, we have images from a symptomatic plaque. In the bottom two rows is an asymptomatic example. int = intensity
  • Figure 3: SEM (Scanning Electron Microscopy) images of diabetes plasma with corresponding dominant Gabor filters. (a) and (c) Original nice spaghetti-like images in healthy controls. (b) and (d) Dominant Gabor filters for (a) and (c) respectively. (g) Dense matted deposits (DMDs) in type 2 diabetes that are removed (e) when we add small amounts of lipopolysaccharide-binding protein (LBP). (f) and (h) Dominant Gabor filters for (e) and (g) respectively. The dominant Gabor filters are shown in the frequency domain. SEM images taken from 2017 Nature Scientific Reports Diabetes and Control Data, Figure 7A.
  • Figure 4: Output of an unsupervised interpretable model for text interpretation for the input: "diabetes plasma is from blood transfusions with high sugar" (Note that it brings up plasma (material) not plasma (display device)! Image created online via ltbev.informatik.uni-hamburg.de/wsd on 27.12.2017, 19:30