EUGens: Efficient, Unified, and General Dense Layers

Sang Min Kim; Byeongchan Kim; Arijit Sehanobish; Somnath Basu Roy Chowdhury; Rahul Kidambi; Dongseok Shim; Avinava Dubey; Snigdha Chaturvedi; Min-hwan Oh; Krzysztof Choromanski

EUGens: Efficient, Unified, and General Dense Layers

Sang Min Kim, Byeongchan Kim, Arijit Sehanobish, Somnath Basu Roy Chowdhury, Rahul Kidambi, Dongseok Shim, Avinava Dubey, Snigdha Chaturvedi, Min-hwan Oh, Krzysztof Choromanski

TL;DR

This work targets the dense-computation bottleneck in neural networks by introducing Efficient, Unified, and General dense layers (EUGens), which use random features and input-norm coupling to approximate standard fully-connected layers with linear-time inference. The authors prove unbiased approximation for polynomial activations and provide concentration and continuity results, while also offering Quasi Monte Carlo variants to reduce variance. Empirically, replacing FFLs with EUGens in GPT-like transformers, Vision Transformers, and neural radiance fields yields substantial speedups (up to 27%) and memory reductions (up to 30%) across image, language, and 3D reconstruction tasks, with a capacity for layer-wise, backpropagation-free distillation. The practical impact is strong: EUGens enable scalable deployment of large models in real-time systems, while maintaining expressive power and enabling post-training compression and efficient adaptation of pre-trained models.

Abstract

Efficient neural networks are essential for scaling machine learning models to real-time applications and resource-constrained environments. Fully-connected feedforward layers (FFLs) introduce computation and parameter count bottlenecks within neural network architectures. To address this challenge, in this work, we propose a new class of dense layers that generalize standard fully-connected feedforward layers, \textbf{E}fficient, \textbf{U}nified and \textbf{Gen}eral dense layers (EUGens). EUGens leverage random features to approximate standard FFLs and go beyond them by incorporating a direct dependence on the input norms in their computations. The proposed layers unify existing efficient FFL extensions and improve efficiency by reducing inference complexity from quadratic to linear time. They also lead to \textbf{the first} unbiased algorithms approximating FFLs with arbitrary polynomial activation functions. Furthermore, EuGens reduce the parameter count and computational overhead while preserving the expressive power and adaptability of FFLs. We also present a layer-wise knowledge transfer technique that bypasses backpropagation, enabling efficient adaptation of EUGens to pre-trained models. Empirically, we observe that integrating EUGens into Transformers and MLPs yields substantial improvements in inference speed (up to \textbf{27}\%) and memory efficiency (up to \textbf{30}\%) across a range of tasks, including image classification, language model pre-training, and 3D scene reconstruction. Overall, our results highlight the potential of EUGens for the scalable deployment of large-scale neural networks in real-world scenarios.

EUGens: Efficient, Unified, and General Dense Layers

TL;DR

Abstract

EUGens: Efficient, Unified, and General Dense Layers

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (32)

Theorems & Definitions (15)