AFN: Adaptive Fusion Normalization via an Encoder-Decoder Framework
Zikai Zhou, Shuo Zhang, Ziruo Wang, Huanran Chen
TL;DR
The paper addresses the stability-generalisation trade-off in normalization layers by proposing Adaptive Fusion Normalisation (AFN), a BN-based extension that learns both standardisation and rescaling statistics via an encoder–decoder, with residual gates to preserve BN behavior early in training. AFN blends batch statistics with network-derived statistics, aiming to reduce gradient instability seen with ASRNorm while improving domain generalisation across tasks such as Digits, CIFAR-10-C, and PACS, as well as image classification on SVHN, MNIST-M, and CIFAR-10/100. Empirical results show AFN consistently outperforms ASRNorm and BN baselines in several settings, with improved training stability and fewer gradient explosions. The work suggests AFN as a practical normalization strategy for cross-domain vision tasks and hints at broader applicability to other modalities like speech recognition, given its encoder–decoder framework for statistics learning.
Abstract
The success of deep learning is inseparable from normalization layers. Researchers have proposed various normalization functions, and each of them has both advantages and disadvantages. In response, efforts have been made to design a unified normalization function that combines all normalization procedures and mitigates their weaknesses. We also proposed a new normalization function called Adaptive Fusion Normalization. Through experiments, we demonstrate AFN outperforms the previous normalization techniques in domain generalization and image classification tasks.
