Table of Contents
Fetching ...

Interpretable Generative and Discriminative Learning for Multimodal and Incomplete Clinical Data

Albert Belenguer-Llorens, Carlos Sevilla-Salcedo, Janaina Mourao-Miranda, Vanessa Gómez-Verdejo

TL;DR

This work introduces OSIRIS, a Bayesian model that unifies generative and discriminative learning to tackle multimodal, incomplete clinical data. By maintaining two interdependent latent spaces $\mathbf{G}$ (generative) and $\mathbf{Z}$ (discriminative), OSIRIS enables automatic missing-view imputation, uncertainty quantification, and sparse latent-factor discovery, while remaining task-focused through a Bayesian logistic link to labels. The authors develop a mean-field variational inference scheme with ARD sparsity and a Taylor-based bound to handle nonconjugate terms, yielding interpretable, compact latent representations that separate task-relevant information from auxiliary structure. Empirical results across diverse datasets (including ADNI) show that OSIRIS often outperforms state-of-the-art baselines in AUC and BACC, robustly imputes missing data, and reveals clinically meaningful biomarker patterns, underscoring its potential for practical multimodal biomarker discovery and disease prognosis.

Abstract

Real-world clinical problems are often characterized by multimodal data, usually associated with incomplete views and limited sample sizes in their cohorts, posing significant limitations for machine learning algorithms. In this work, we propose a Bayesian approach designed to efficiently handle these challenges while providing interpretable solutions. Our approach integrates (1) a generative formulation to capture cross-view relationships with a semi-supervised strategy, and (2) a discriminative task-oriented formulation to identify relevant information for specific downstream objectives. This dual generative-discriminative formulation offers both general understanding and task-specific insights; thus, it provides an automatic imputation of the missing views while enabling robust inference across different data sources. The potential of this approach becomes evident when applied to the multimodal clinical data, where our algorithm is able to capture and disentangle the complex interactions among biological, psychological, and sociodemographic modalities.

Interpretable Generative and Discriminative Learning for Multimodal and Incomplete Clinical Data

TL;DR

This work introduces OSIRIS, a Bayesian model that unifies generative and discriminative learning to tackle multimodal, incomplete clinical data. By maintaining two interdependent latent spaces (generative) and (discriminative), OSIRIS enables automatic missing-view imputation, uncertainty quantification, and sparse latent-factor discovery, while remaining task-focused through a Bayesian logistic link to labels. The authors develop a mean-field variational inference scheme with ARD sparsity and a Taylor-based bound to handle nonconjugate terms, yielding interpretable, compact latent representations that separate task-relevant information from auxiliary structure. Empirical results across diverse datasets (including ADNI) show that OSIRIS often outperforms state-of-the-art baselines in AUC and BACC, robustly imputes missing data, and reveals clinically meaningful biomarker patterns, underscoring its potential for practical multimodal biomarker discovery and disease prognosis.

Abstract

Real-world clinical problems are often characterized by multimodal data, usually associated with incomplete views and limited sample sizes in their cohorts, posing significant limitations for machine learning algorithms. In this work, we propose a Bayesian approach designed to efficiently handle these challenges while providing interpretable solutions. Our approach integrates (1) a generative formulation to capture cross-view relationships with a semi-supervised strategy, and (2) a discriminative task-oriented formulation to identify relevant information for specific downstream objectives. This dual generative-discriminative formulation offers both general understanding and task-specific insights; thus, it provides an automatic imputation of the missing views while enabling robust inference across different data sources. The potential of this approach becomes evident when applied to the multimodal clinical data, where our algorithm is able to capture and disentangle the complex interactions among biological, psychological, and sociodemographic modalities.

Paper Structure

This paper contains 37 sections, 159 equations, 16 figures, 10 tables, 1 algorithm.

Figures (16)

  • Figure 1: Diagram of the graphical model of OSIRIS and a zoom into each input view. Grey circles denote observed variables, white circles unobserved variables. Blue font denotes generative variables and green font task-oriented variables. Rectangles represent node groups dependent on the view's feature space.
  • Figure 2: Evolution of OSIRIS BACC as missing value rates increase, comparing its internal imputation mechanism to external imputation methods applied as preprocessing. It displays the mean performance of each model across ten randomly generated missing masks.
  • Figure 3: Stable weight and loading matrices identified by both the task-oriented space (a) and the generative space (b). Columns represent the stable factors, while rows correspond to the features. In subfigure (a), the acronyms used denote the following: L (left), R (right), S (sulcus), and G (gyrus).
  • Figure 4: Factor that captures the variability of the bile acids measures across the sample.
  • Figure 5: Factor that captures the variability of the depression measures across the sample.
  • ...and 11 more figures