Latent label distribution grid representation for modeling uncertainty
ShuNing Sun, YinSong Xiong, Yu Zhang, Zhuoran Zheng
TL;DR
LDL can capture polysemy but annotation noise induces uncertainty in the label space, harming learning. The paper presents Latent Label Distribution Grid (LLDG), which builds a label correlation grid from label differences and expands entries into Gaussian components to form a latent 3D grid, then uses Tucker-based low-rank regularization to suppress noise; a local-global feature extractor produces the grid and an LLDG-Mixer reconstructs the final label distribution with a combined loss. Key contributions include integrating correlation-based representations with probabilistic expansion, introducing Tucker-based noise reduction, and demonstrating the approach on 15 LDL datasets plus a MedMNIST transfer, with strong robustness to noise. The work offers a principled, scalable framework for uncertainty-aware LDL and extends naturally to downstream classification tasks. Overall, LLDG provides improved stability and performance by explicitly modeling label-space uncertainty through a latent grid structure.
Abstract
Although \textbf{L}abel \textbf{D}istribution \textbf{L}earning (LDL) has promising representation capabilities for characterizing the polysemy of an instance, the complexity and high cost of the label distribution annotation lead to inexact in the construction of the label space. The existence of a large number of inexact labels generates a label space with uncertainty, which misleads the LDL algorithm to yield incorrect decisions. To alleviate this problem, we model the uncertainty of label distributions by constructing a \textbf{L}atent \textbf{L}abel \textbf{D}istribution \textbf{G}rid (LLDG) to form a low-noise representation space. Specifically, we first construct a label correlation matrix based on the differences between labels, and then expand each value of the matrix into a vector that obeys a Gaussian distribution, thus building a LLDG to model the uncertainty of the label space. Finally, the LLDG is reconstructed by the LLDG-Mixer to generate an accurate label distribution. Note that we enforce a customized low-rank scheme on this grid, which assumes that the label relations may be noisy and it needs to perform noise-reduction with the help of a Tucker reconstruction technique. Furthermore, we attempt to evaluate the effectiveness of the LLDG by considering its generation as an upstream task to achieve the classification of the objects. Extensive experimental results show that our approach performs competitively on several benchmarks.
