Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification
Haojian Huang, Chuanyu Qin, Zhe Liu, Kaijing Ma, Jin Chen, Han Fang, Chao Ban, Hao Sun, Zhongjiang He
TL;DR
This work tackles multi-view classification under domain gaps and conflicting views by introducing TUNED, a framework that fuses local feature-neighborhood structures within each view with global neighborhood consensus. A selective Markov random field (S-MRF) adaptively weighs cross-view evidence, while a shared parameterized evidence extractor learns Dirichlet-distributed evidence conditioned on local structures. Neighborhood-aware deep learning integrates local and global information to produce robust uncertainty estimates through Dirichlet parameters $\boldsymbol{\alpha}^v$ and corresponding beliefs $\boldsymbol{b}^v$ and uncertainty $u^v$, with losses that balance evidence quality and cross-view consistency. Empirical results on eight benchmarks show improved accuracy and resilience to conflicting views, outperforming DST-based fusion and other state-of-the-art evidential MVC methods; the approach advances reliable multi-view fusion in uncertain, heterogeneous data settings.
Abstract
Multi-view classification (MVC) faces inherent challenges due to domain gaps and inconsistencies across different views, often resulting in uncertainties during the fusion process. While Evidential Deep Learning (EDL) has been effective in addressing view uncertainty, existing methods predominantly rely on the Dempster-Shafer combination rule, which is sensitive to conflicting evidence and often neglects the critical role of neighborhood structures within multi-view data. To address these limitations, we propose a Trusted Unified Feature-NEighborhood Dynamics (TUNED) model for robust MVC. This method effectively integrates local and global feature-neighborhood (F-N) structures for robust decision-making. Specifically, we begin by extracting local F-N structures within each view. To further mitigate potential uncertainties and conflicts in multi-view fusion, we employ a selective Markov random field that adaptively manages cross-view neighborhood dependencies. Additionally, we employ a shared parameterized evidence extractor that learns global consensus conditioned on local F-N structures, thereby enhancing the global integration of multi-view features. Experiments on benchmark datasets show that our method improves accuracy and robustness over existing approaches, particularly in scenarios with high uncertainty and conflicting views. The code will be made available at https://github.com/JethroJames/TUNED.
