Table of Contents
Fetching ...

Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception

Luke Chen, Junyao Wang, Trier Mortlock, Pramod Khargonekar, Mohammad Abdullah Al Faruque

TL;DR

HyperDUM introduces a deterministic, hyperdimensional approach to quantify feature-level epistemic uncertainty in multimodal autonomous vehicle perception. By projecting latent features into hyperdimensional space and forming prototypes, it estimates uncertainty via prototype–feature similarity, with channel-wise (CPB) and patch-wise (PPB) mechanisms preserving per-channel and spatial uncertainty. Pre-fusion uncertainty weighting combined with these prototypes yields consistent improvements in 3D object detection and semantic segmentation under diverse and adversarial conditions, while dramatically reducing FLOPs and parameter counts compared to Bayesian baselines. The method demonstrates robustness to distribution shifts, offers practical efficiency for real-time AV systems, and is supported by comprehensive ablations, cost analyses, and supplemental theory on hyperdimensional expressivity and uncertainty quantifiability.

Abstract

Uncertainty Quantification (UQ) is crucial for ensuring the reliability of machine learning models deployed in real-world autonomous systems. However, existing approaches typically quantify task-level output prediction uncertainty without considering epistemic uncertainty at the multimodal feature fusion level, leading to sub-optimal outcomes. Additionally, popular uncertainty quantification methods, e.g., Bayesian approximations, remain challenging to deploy in practice due to high computational costs in training and inference. In this paper, we propose HyperDUM, a novel deterministic uncertainty method (DUM) that efficiently quantifies feature-level epistemic uncertainty by leveraging hyperdimensional computing. Our method captures the channel and spatial uncertainties through channel and patch -wise projection and bundling techniques respectively. Multimodal sensor features are then adaptively weighted to mitigate uncertainty propagation and improve feature fusion. Our evaluations show that HyperDUM on average outperforms the state-of-the-art (SOTA) algorithms by up to 2.01%/1.27% in 3D Object Detection and up to 1.29% improvement over baselines in semantic segmentation tasks under various types of uncertainties. Notably, HyperDUM requires 2.36x less Floating Point Operations and up to 38.30x less parameters than SOTA methods, providing an efficient solution for real-world autonomous systems.

Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception

TL;DR

HyperDUM introduces a deterministic, hyperdimensional approach to quantify feature-level epistemic uncertainty in multimodal autonomous vehicle perception. By projecting latent features into hyperdimensional space and forming prototypes, it estimates uncertainty via prototype–feature similarity, with channel-wise (CPB) and patch-wise (PPB) mechanisms preserving per-channel and spatial uncertainty. Pre-fusion uncertainty weighting combined with these prototypes yields consistent improvements in 3D object detection and semantic segmentation under diverse and adversarial conditions, while dramatically reducing FLOPs and parameter counts compared to Bayesian baselines. The method demonstrates robustness to distribution shifts, offers practical efficiency for real-time AV systems, and is supported by comprehensive ablations, cost analyses, and supplemental theory on hyperdimensional expressivity and uncertainty quantifiability.

Abstract

Uncertainty Quantification (UQ) is crucial for ensuring the reliability of machine learning models deployed in real-world autonomous systems. However, existing approaches typically quantify task-level output prediction uncertainty without considering epistemic uncertainty at the multimodal feature fusion level, leading to sub-optimal outcomes. Additionally, popular uncertainty quantification methods, e.g., Bayesian approximations, remain challenging to deploy in practice due to high computational costs in training and inference. In this paper, we propose HyperDUM, a novel deterministic uncertainty method (DUM) that efficiently quantifies feature-level epistemic uncertainty by leveraging hyperdimensional computing. Our method captures the channel and spatial uncertainties through channel and patch -wise projection and bundling techniques respectively. Multimodal sensor features are then adaptively weighted to mitigate uncertainty propagation and improve feature fusion. Our evaluations show that HyperDUM on average outperforms the state-of-the-art (SOTA) algorithms by up to 2.01%/1.27% in 3D Object Detection and up to 1.29% improvement over baselines in semantic segmentation tasks under various types of uncertainties. Notably, HyperDUM requires 2.36x less Floating Point Operations and up to 38.30x less parameters than SOTA methods, providing an efficient solution for real-world autonomous systems.

Paper Structure

This paper contains 32 sections, 1 theorem, 26 equations, 6 figures, 15 tables.

Key Result

Corollary 1

A VSA can express an uncertainty similarity matrix $\mathcal{U} \in \mathbb{R}^{L\times N}$ if for any $\eta > 0$, there exists a $d \in \mathbb{N}$ and $d$-dimensional hypervectors $\mathcal{H}_{1}, \mathcal{H}_{2}, \cdots, \mathcal{H}_{N}$ such that $|\mathcal{U}_{i,j} - \delta(\mathcal{H}_{i}, \m

Figures (6)

  • Figure 1: Multimodal model with uncertainty quantification and uncertainty fusion for autonomous driving perception tasks.
  • Figure 2: Corner cases for the aiMotive 3D Object Detection dataset. (Top-Left) Normal Lidar point-cloud birds-eye-view. (Top-Right) Lidar Foggification (LF). (Bottom) In order from left to right (Normal, Motion Blur (MB), Over-Exposure (OE), Under-Exposure (UE))
  • Figure 3: Analysis on the effects of hyperdimension selection for channel projection on performance and computation (DeLiVER).
  • Figure 4: Analysis on the effects of patches and patch hyperdimension selection on performance and computation (DeLiVER).
  • Figure 5: Architectures diagram showing where we insert the uncertainty module for pre and post fusion methods. The uncertainty weighting is a single Conv layer, Input: (B,M,P,C)/(B,M,P,1), kernel=(P,1), stride=(P,1), Output: (B,M,1,C)/(B,M,1,1) for channel/patch weights respectively. B=Batch, M=Modality, P=Prototype, C=Channel. Channel/Patch weights are multiplied uniformly across all spatial/channel dimensions per channel/patch. The weighting module performs dimension matching automatically.
  • ...and 1 more figures

Theorems & Definitions (4)

  • Definition 1
  • Corollary 1
  • Definition 2
  • Definition 3