Context Normalization Layer with Applications
Bilal Faye, Mohamed-Djallel Dilmi, Hanane Azzag, Mustapha Lebbah, Djamel Bouchaffra
TL;DR
The paper introduces Context Normalization (CN), a context-aware normalization method for image data that treats data as arising from a mixture of contexts and learns per-context statistics $\mu_r$ and $\sigma_r$ via a context embedder. CN extends Mixture Normalization by avoiding EM, enabling end-to-end differentiable learning, with CN-Patches and CN-Channels variants and an enhanced CN+ inference strategy that aggregates across contexts using posterior weights $\tau_r(x)$. Empirical results on CIFAR-10/100, ViT-based architectures, and domain adaptation scenarios (e.g., AdaMatch) show that CN accelerates convergence and yields superior accuracy over Batch Normalization and Mixture Normalization, often by large margins and across different learning rates and data distributions. The work demonstrates CN’s versatility across supervised, self-supervised, and domain-adaptive settings, highlighting its practical impact for robust, context-aware normalization in modern computer vision pipelines.
Abstract
Normalization is a pre-processing step that converts the data into a more usable representation. As part of the deep neural networks (DNNs), the batch normalization (BN) technique uses normalization to address the problem of internal covariate shift. It can be packaged as general modules, which have been extensively integrated into various DNNs, to stabilize and accelerate training, presumably leading to improved generalization. However, the effect of BN is dependent on the mini-batch size and it does not take into account any groups or clusters that may exist in the dataset when estimating population statistics. This study proposes a new normalization technique, called context normalization, for image data. This approach adjusts the scaling of features based on the characteristics of each sample, which improves the model's convergence speed and performance by adapting the data values to the context of the target task. The effectiveness of context normalization is demonstrated on various datasets, and its performance is compared to other standard normalization techniques.
