Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations
Eirikur Agustsson, Fabian Mentzer, Michael Tschannen, Lukas Cavigelli, Radu Timofte, Luca Benini, Luc Van Gool
TL;DR
The paper presents a unified end-to-end framework for learning compressible representations by softly relaxing quantization and entropy and then annealing to hard quantization. It introduces soft-to-hard vector quantization with differentiable surrogates for quantization and entropy, using a histogram-based, nonparametric entropy estimate and a continuation-based annealing schedule. The method is demonstrated on image compression via a compressive autoencoder and on DNN compression using a ResNet, achieving competitive results with simpler training and fewer distributional assumptions. Key contributions include a differentiable soft quantizer, a soft entropy upper bound, and an end-to-end optimization of both network parameters and quantization centers to minimize the rate-distortion objective. This framework broadens the scope of end-to-end learned compression to jointly handle feature and parameter quantization across diverse tasks.
Abstract
We present a new approach to learn compressible representations in deep architectures with an end-to-end training strategy. Our method is based on a soft (continuous) relaxation of quantization and entropy, which we anneal to their discrete counterparts throughout training. We showcase this method for two challenging applications: Image compression and neural network compression. While these tasks have typically been approached with different methods, our soft-to-hard quantization approach gives results competitive with the state-of-the-art for both.
