Designed Dithering Sign Activation for Binary Neural Networks
Brayan Monroy, Juan Estupiñan, Tatiana Gelvez-Barrera, Jorge Bacca, Henry Arguello
TL;DR
This work tackles information loss in Binary Neural Networks caused by binarizing activations. It introduces DeSign, a designed dithering Sign activation using a spatially periodic threshold kernel that is optimized to preserve structural information while maintaining binary computations. The threshold kernel design combines a brute-force selection based on total variation and an entry-scaling step to BN distributions, with 2D and 3D variants (DeSign3D) to exploit spatial and channel correlations. Empirical evaluations on CIFAR-10/100 and STL-10 show DeSign improves accuracy over standard Sign-based BNNs and reduces sensitivity to learned batch normalization parameters, achieving up to 4.51% gains without additional computational cost. The method offers a practical path to higher-performing, energy-efficient BNNs and can extend to other activations and end-to-end design strategies.
Abstract
Binary Neural Networks emerged as a cost-effective and energy-efficient solution for computer vision tasks by binarizing either network weights or activations. However, common binary activations, such as the Sign activation function, abruptly binarize the values with a single threshold, losing fine-grained details in the feature outputs. This work proposes an activation that applies multiple thresholds following dithering principles, shifting the Sign activation function for each pixel according to a spatially periodic threshold kernel. Unlike literature methods, the shifting is defined jointly for a set of adjacent pixels, taking advantage of spatial correlations. Experiments over the classification task demonstrate the effectiveness of the designed dithering Sign activation function as an alternative activation for binary neural networks, without increasing the computational cost. Further, DeSign balances the preservation of details with the efficiency of binary operations.
