Table of Contents
Fetching ...

On Polynomial Approximations for Privacy-Preserving and Verifiable ReLU Networks

Ramy E. Ali, Jinhyun So, A. Salman Avestimehr

TL;DR

This work tackles privacy-preserving and verifiable inference for ReLU networks outsourced to untrusted clouds by replacing non-polynomial activations with polynomials. It analyzes the feasibility of approximating ReLU with degree-2 polynomials under integer-coefficient or finite-field constraints, showing that the classic ReLU cannot be uniformly approximated on $I=[-1,1]$ with integer coefficients, but a scaled ReLU with even scale can, motivating the surrogate $\sigma_{\mathrm{poly}}(x)=x^2+ax$. Through minimax and uniform-approximation theory, it derives $\sigma_{\mathrm{poly}}$ and demonstrates, across CIFAR-10/100 and Tiny-ImageNet with architectures like LiU’s CNN, NIN, LeNet, and VGG-16, that the polynomial activation yields substantial accuracy gains over the square function and close-to-ReLU performance, up to 10.4% in some settings. This establishes a practically effective pathway for secure, private, and verifiable inference using finite-field arithmetic in DNNs and points to future work on broader architectures and coefficient choices.

Abstract

Outsourcing deep neural networks (DNNs) inference tasks to an untrusted cloud raises data privacy and integrity concerns. While there are many techniques to ensure privacy and integrity for polynomial-based computations, DNNs involve non-polynomial computations. To address these challenges, several privacy-preserving and verifiable inference techniques have been proposed based on replacing the non-polynomial activation functions such as the rectified linear unit (ReLU) function with polynomial activation functions. Such techniques usually require polynomials with integer coefficients or polynomials over finite fields. Motivated by such requirements, several works proposed replacing the ReLU function with the square function. In this work, we empirically show that the square function is not the best degree-2 polynomial that can replace the ReLU function even when restricting the polynomials to have integer coefficients. We instead propose a degree-2 polynomial activation function with a first order term and empirically show that it can lead to much better models. Our experiments on the CIFAR and Tiny ImageNet datasets on various architectures such as VGG-16 show that our proposed function improves the test accuracy by up to 10.4% compared to the square function.

On Polynomial Approximations for Privacy-Preserving and Verifiable ReLU Networks

TL;DR

This work tackles privacy-preserving and verifiable inference for ReLU networks outsourced to untrusted clouds by replacing non-polynomial activations with polynomials. It analyzes the feasibility of approximating ReLU with degree-2 polynomials under integer-coefficient or finite-field constraints, showing that the classic ReLU cannot be uniformly approximated on with integer coefficients, but a scaled ReLU with even scale can, motivating the surrogate . Through minimax and uniform-approximation theory, it derives and demonstrates, across CIFAR-10/100 and Tiny-ImageNet with architectures like LiU’s CNN, NIN, LeNet, and VGG-16, that the polynomial activation yields substantial accuracy gains over the square function and close-to-ReLU performance, up to 10.4% in some settings. This establishes a practically effective pathway for secure, private, and verifiable inference using finite-field arithmetic in DNNs and points to future work on broader architectures and coefficient choices.

Abstract

Outsourcing deep neural networks (DNNs) inference tasks to an untrusted cloud raises data privacy and integrity concerns. While there are many techniques to ensure privacy and integrity for polynomial-based computations, DNNs involve non-polynomial computations. To address these challenges, several privacy-preserving and verifiable inference techniques have been proposed based on replacing the non-polynomial activation functions such as the rectified linear unit (ReLU) function with polynomial activation functions. Such techniques usually require polynomials with integer coefficients or polynomials over finite fields. Motivated by such requirements, several works proposed replacing the ReLU function with the square function. In this work, we empirically show that the square function is not the best degree-2 polynomial that can replace the ReLU function even when restricting the polynomials to have integer coefficients. We instead propose a degree-2 polynomial activation function with a first order term and empirically show that it can lead to much better models. Our experiments on the CIFAR and Tiny ImageNet datasets on various architectures such as VGG-16 show that our proposed function improves the test accuracy by up to 10.4% compared to the square function.

Paper Structure

This paper contains 13 sections, 4 theorems, 9 equations, 10 figures, 7 tables.

Key Result

Lemma 1

If the interval $I$ is of length four or more, then the only functions that can be uniformly approximated by polynomials with integer coefficient are those polynomials themselves.

Figures (10)

  • Figure 1: Our function $\sigma_{\mathrm{poly}}(x)=x^2+x$ is shown versus the scaled ReLU function $\sigma_{\mathrm{sr}}(x; 2)=2 \sigma_{\mathrm r}(x)$.
  • Figure 2: Accuracy of the CNN architecture in liu2017oblivious on the CIFAR-$10$ dataset.
  • Figure 3: Accuracy of the CNN architecture in liu2017oblivious on the CIFAR-$100$ dataset.
  • Figure 4: Accuracy of the NIN architecture in lin2013network on the CIFAR-$10$ dataset.
  • Figure 5: Accuracy of the NIN architecture in lin2013network on the CIFAR-$100$ dataset.
  • ...and 5 more figures

Theorems & Definitions (12)

  • Lemma 1
  • Lemma 2
  • Theorem 1
  • Lemma 3
  • Remark 1
  • Remark 2
  • Remark 3
  • Remark 4
  • Remark 5
  • Remark 6
  • ...and 2 more