Quantized Convolutional Neural Networks Through the Lens of Partial Differential Equations
Ido Ben-Yair, Gil Ben Shalom, Moshe Eliasof, Eran Treister
TL;DR
This work investigates quantized neural networks through a partial differential equation (PDE) lens, introducing total-variation (TV) based edge-aware smoothing and forward-stable architectures to mitigate quantization noise. By treating activation error as diffusion-like noise and enforcing stability via symmetric dynamics, the authors design TV-augmented networks and stable variants of ResNet, MobileNetV2, and PDE-GCNs. Empirical results across image classification and semi-supervised graph tasks show that stable, quantized models can achieve comparable accuracy to full-precision networks while using fewer parameters, with TV layers improving quantization fidelity. The findings suggest that PDE-inspired stability enhances reliability and efficiency for edge and real-time deployments, including autonomous driving scenarios.
Abstract
Quantization of Convolutional Neural Networks (CNNs) is a common approach to ease the computational burden involved in the deployment of CNNs, especially on low-resource edge devices. However, fixed-point arithmetic is not natural to the type of computations involved in neural networks. In this work, we explore ways to improve quantized CNNs using PDE-based perspective and analysis. First, we harness the total variation (TV) approach to apply edge-aware smoothing to the feature maps throughout the network. This aims to reduce outliers in the distribution of values and promote piece-wise constant maps, which are more suitable for quantization. Secondly, we consider symmetric and stable variants of common CNNs for image classification, and Graph Convolutional Networks (GCNs) for graph node-classification. We demonstrate through several experiments that the property of forward stability preserves the action of a network under different quantization rates. As a result, stable quantized networks behave similarly to their non-quantized counterparts even though they rely on fewer parameters. We also find that at times, stability even aids in improving accuracy. These properties are of particular interest for sensitive, resource-constrained, low-power or real-time applications like autonomous driving.
