Neural Prior Estimation: Learning Class Priors from Latent Representations
Masoud Yavari, Payman Moallem
TL;DR
This work tackles class-imbalance bias in deep recognition by learning explicit class priors directly from latent representations. It introduces the Neural Prior Estimator (NPE), comprising one or more Prior Estimation Modules (PEMs) trained with a one-way logistic loss to produce a feature-conditioned prior signal, which is then used in logit adjustment (NPE-LA) at inference via $\\tilde{\\mathbf{z}}(\\mathbf{x}) = \\mathbf{z}(\\mathbf{x}) - \\boldsymbol{\\eta}(\\mathbf{x})$. The authors provide theoretical justification under the Neural Collapse regime showing that PEMs estimate a monotone transformation of the empirical counts, i.e., the log-prior up to an additive constant, and demonstrate strong empirical gains on long-tailed CIFAR and segmentation benchmarks (STARE, ADE20K), with careful handling of scaling and BN-free design for stable dense predictions. The approach is lightweight, inference-efficient, and compatible with existing augmentation and representation-learning pipelines, offering a principled path to online, adaptive imbalance correction without requiring explicit priors or re-sampling. Overall, NPE-LA delivers robust, feature-aware bias mitigation with clear theoretical and practical benefits for both instance-level and dense prediction tasks.
Abstract
Class imbalance induces systematic bias in deep neural networks by imposing a skewed effective class prior. This work introduces the Neural Prior Estimator (NPE), a framework that learns feature-conditioned log-prior estimates from latent representations. NPE employs one or more Prior Estimation Modules trained jointly with the backbone via a one-way logistic loss. Under the Neural Collapse regime, NPE is analytically shown to recover the class log-prior up to an additive constant, providing a theoretically grounded adaptive signal without requiring explicit class counts or distribution-specific hyperparameters. The learned estimate is incorporated into logit adjustment, forming NPE-LA, a principled mechanism for bias-aware prediction. Experiments on long-tailed CIFAR and imbalanced semantic segmentation benchmarks (STARE, ADE20K) demonstrate consistent improvements, particularly for underrepresented classes. NPE thus offers a lightweight and theoretically justified approach to learned prior estimation and imbalance-aware prediction.
