Credibility-Aware Multi-Modal Fusion Using Probabilistic Circuits
Sahil Sidheekh, Pranuthi Tenali, Saurabh Mathur, Erik Blasch, Kristian Kersting, Sriraam Natarajan
TL;DR
The paper tackles credibility-aware late fusion for noisy, multi-modal data by modeling the joint distribution of unimodal predictions and the target with Probabilistic Circuits (PCs). It defines a principled credibility measure based on divergence and conditional entropy, and introduces two fusion variants: Direct-PC (DPC) and Credibility-Weighted Mean (CWM). The authors establish that PCs enable tractable inference for predictive and credibility queries and demonstrate competitive performance across multiple datasets (AV-MNIST, CUB, NYUD, SUNRGBD) while providing reliable modality credibility estimates. The work offers a principled, robust, and scalable approach to multi-modal fusion with explicit uncertainty and source reliability considerations, with potential impact on safety-critical applications.
Abstract
We consider the problem of late multi-modal fusion for discriminative learning. Motivated by noisy, multi-source domains that require understanding the reliability of each data source, we explore the notion of credibility in the context of multi-modal fusion. We propose a combination function that uses probabilistic circuits (PCs) to combine predictive distributions over individual modalities. We also define a probabilistic measure to evaluate the credibility of each modality via inference queries over the PC. Our experimental evaluation demonstrates that our fusion method can reliably infer credibility while maintaining competitive performance with the state-of-the-art.
