Table of Contents
Fetching ...

Conformal Prediction for Multimodal Regression

Alexis Bose, Jonathan Ethier, Paul Guinand

TL;DR

This paper highlights the potential for internal neural network features, extracted from convergence points where multimodal information is combined, to be used by conformal prediction to construct prediction intervals (PIs), paving new paths for deploying conformal prediction in domains abundant with multimodal data.

Abstract

This paper introduces multimodal conformal regression. Traditionally confined to scenarios with solely numerical input features, conformal prediction is now extended to multimodal contexts through our methodology, which harnesses internal features from complex neural network architectures processing images and unstructured text. Our findings highlight the potential for internal neural network features, extracted from convergence points where multimodal information is combined, to be used by conformal prediction to construct prediction intervals (PIs). This capability paves new paths for deploying conformal prediction in domains abundant with multimodal data, enabling a broader range of problems to benefit from guaranteed distribution-free uncertainty quantification.

Conformal Prediction for Multimodal Regression

TL;DR

This paper highlights the potential for internal neural network features, extracted from convergence points where multimodal information is combined, to be used by conformal prediction to construct prediction intervals (PIs), paving new paths for deploying conformal prediction in domains abundant with multimodal data.

Abstract

This paper introduces multimodal conformal regression. Traditionally confined to scenarios with solely numerical input features, conformal prediction is now extended to multimodal contexts through our methodology, which harnesses internal features from complex neural network architectures processing images and unstructured text. Our findings highlight the potential for internal neural network features, extracted from convergence points where multimodal information is combined, to be used by conformal prediction to construct prediction intervals (PIs). This capability paves new paths for deploying conformal prediction in domains abundant with multimodal data, enabling a broader range of problems to benefit from guaranteed distribution-free uncertainty quantification.

Paper Structure

This paper contains 20 sections, 36 figures, 2 tables.

Figures (36)

  • Figure 1: DTU RSRP Correctional Neural Network Thrane2020, nguyen2023deep
  • Figure 2: DTU RSRP dataset counts
  • Figure 3: Multimodal Toolkit architecture gu-budhkar-2021-package
  • Figure 4: Multimodal Toolkit dataset counts and descriptions
  • Figure 5: PIs from image-only model using internal image features
  • ...and 31 more figures