LFFR: Logistic Function For (single-output) Regression

John Chiang

LFFR: Logistic Function For (single-output) Regression

John Chiang

TL;DR

This work addresses privacy-preserving regression under fully homomorphic encryption by introducing LFFR, a logistic-function–based regression that extends linear and ridge regression with a Newton-like update. Central to the method is a Simplified Fixed Hessian that yields a diagonal, invertible bound allowing encrypted optimization without costly Hessian inversion, together with a polynomial sigmoid approximation for ciphertext-friendly computation. The paper also proposes an improved LFFR variant that normalizes targets and incorporates a gamma-controlled transformation to extend predictive range while maintaining HE efficiency. Experiments implemented with HEAAN on Boston and California Housing datasets demonstrate the feasibility of encrypted nonlinear regression and quantify the trade-offs between LFFR variants, normalization, and computational overhead. Overall, the approach provides a practical pathway to nonlinear, privacy-preserving regression in real-world encrypted deployments.

Abstract

Privacy-preserving regression in machine learning is a crucial area of research, aimed at enabling the use of powerful machine learning techniques while protecting individuals' privacy. In this paper, we implement privacy-preserving regression training using data encrypted under a fully homomorphic encryption scheme. We first examine the common linear regression algorithm and propose a (simplified) fixed Hessian for linear regression training, which can be applied for any datasets even not normalized into the range $[0, 1]$. We also generalize this constant Hessian matrix to the ridge regression version, namely linear regression which includes a regularization term to penalize large coefficients. However, our main contribution is to develop a novel and efficient algorithm called LFFR for homomorphic regression using the logistic function, which could model more complex relations between input values and output prediction in comparison with linear regression. We also find a constant simplified Hessian to train our LFFR algorithm using the Newton-like method and compare it against to with our new fixed Hessian linear regression training over two real-world datasets. We suggest normalizing not only the data but also the target predictions even for the original linear regression used in a privacy-preserving manner, which is helpful to remain weights in a small range, say $[-5, +5]$ good for refreshing ciphertext setting parameters, and avoid tuning the regularization parameter $λ$ via cross validation. The linear regression with normalized predictions could be a viable alternative to ridge regression.

LFFR: Logistic Function For (single-output) Regression

TL;DR

Abstract

. We also generalize this constant Hessian matrix to the ridge regression version, namely linear regression which includes a regularization term to penalize large coefficients. However, our main contribution is to develop a novel and efficient algorithm called LFFR for homomorphic regression using the logistic function, which could model more complex relations between input values and output prediction in comparison with linear regression. We also find a constant simplified Hessian to train our LFFR algorithm using the Newton-like method and compare it against to with our new fixed Hessian linear regression training over two real-world datasets. We suggest normalizing not only the data but also the target predictions even for the original linear regression used in a privacy-preserving manner, which is helpful to remain weights in a small range, say

good for refreshing ciphertext setting parameters, and avoid tuning the regularization parameter

via cross validation. The linear regression with normalized predictions could be a viable alternative to ridge regression.

Paper Structure (31 sections, 24 equations, 1 algorithm)

This paper contains 31 sections, 24 equations, 1 algorithm.

Introduction
Background
Related work
HE-Based Approaches.
MPC-Based Approaches.
Contributions
Preliminaries
Fully Homomorphic Encryption
Simplified Fixed Hessian
Linear Regression Model
Ridge Regression
Technical Details
Our Linear Regression Algorithm
Logistic Function for Regression
A Simple Assumption
...and 16 more sections

LFFR: Logistic Function For (single-output) Regression

TL;DR

Abstract

LFFR: Logistic Function For (single-output) Regression

Authors

TL;DR

Abstract

Table of Contents