InDeed: Interpretable image deep decomposition with guaranteed generalizability
Sihan Wang, Shangqi Gao, Fuping Wu, Xiahai Zhuang
TL;DR
InDeed addresses the challenge of interpretable and generalizable deep image decomposition by marrying hierarchical Bayesian modeling with deep inference in a modular architecture. It introduces a three-step framework: hierarchical decomposition of Y into $L,S,N$ with $L=AB^T$, variational inference split into two sub-problems (one closed-form, one learned via a neural network $f_ heta$), and a modular DNN that mirrors the probabilistic graph. A PAC-Bayesian generalization bound motivates test-time adaptation (InDeedAG/InDeedOAG), enabling robust performance under distribution shifts, demonstrated on image denoising and unsupervised anomaly detection with strong OOD generalization and interpretable intermediate outputs. Key contributions include closed-form leaf updates, a VI-based objective with interpretable decomposed terms, a modular architecture aligned with the HB model, and an active generalization framework that adapts efficiently at test time. The work provides practical impact for deploying interpretable, generalizable image decomposition models in real-world tasks such as denoising and defect detection, with a clear pathway for extending to other decomposition priors and transfer across tasks.
Abstract
Image decomposition aims to analyze an image into elementary components, which is essential for numerous downstream tasks and also by nature provides certain interpretability to the analysis. Deep learning can be powerful for such tasks, but surprisingly their combination with a focus on interpretability and generalizability is rarely explored. In this work, we introduce a novel framework for interpretable deep image decomposition, combining hierarchical Bayesian modeling and deep learning to create an architecture-modularized and model-generalizable deep neural network (DNN). The proposed framework includes three steps: (1) hierarchical Bayesian modeling of image decomposition, (2) transforming the inference problem into optimization tasks, and (3) deep inference via a modularized Bayesian DNN. We further establish a theoretical connection between the loss function and the generalization error bound, which inspires a new test-time adaptation approach for out-of-distribution scenarios. We instantiated the application using two downstream tasks, \textit{i.e.}, image denoising, and unsupervised anomaly detection, and the results demonstrated improved generalizability as well as interpretability of our methods. The source code will be released upon the acceptance of this paper.
