Explainability in Generative Medical Diffusion Models: A Faithfulness-Based Analysis on MRI Synthesis
Surjo Dey, Pallabi Saikia
TL;DR
This paper tackles the limited explainability of diffusion-based MRI synthesis by introducing a faithfulness-based prototype framework that links diffusion-generated features to training prototypes. It combines Denoising Diffusion Probabilistic Models (DDPMs) with prototype-based networks PPNet, Enhanced ProtoPNet (EPPNet), and ProtoPool, and defines a Faithfulness Score to quantify alignment between generated outputs and training data. Empirical results on the DUKE Breast MRI dataset show high image fidelity (PSNR $=19.37\pm1.67$, SSIM $=0.6530\pm0.1052$, LPIPS $=0.2893\pm0.1050$) and demonstrate that EPPNet yields the strongest faithfulness ($F=0.1534$) among the methods. The findings indicate diffusion models can be both accurate and transparent, advancing safe and trustworthy AI applications in medical imaging by revealing the denoising trajectory and prototype associations that drive MRI synthesis.
Abstract
This study investigates the explainability of generative diffusion models in the context of medical imaging, focusing on Magnetic resonance imaging (MRI) synthesis. Although diffusion models have shown strong performance in generating realistic medical images, their internal decision making process remains largely opaque. We present a faithfulness-based explainability framework that analyzes how prototype-based explainability methods like ProtoPNet (PPNet), Enhanced ProtoPNet (EPPNet), and ProtoPool can link the relationship between generated and training features. Our study focuses on understanding the reasoning behind image formation through denoising trajectory of diffusion model and subsequently prototype explainability with faithfulness analysis. Experimental analysis shows that EPPNet achieves the highest faithfulness (with score 0.1534), offering more reliable insights, and explainability into the generative process. The results highlight that diffusion models can be made more transparent and trustworthy through faithfulness-based explanations, contributing to safer and more interpretable applications of generative AI in healthcare.
