Neural Network Training on Encrypted Data with TFHE
Luis Montero, Jordan Frery, Celia Kherfallah, Roman Bredehoft, Andrei Stoian
TL;DR
The paper tackles confidential training of neural networks on encrypted data, addressing leakage risks in multi-party settings where data may be horizontally or vertically distributed. It introduces a unified training pipeline built on Fully Homomorphic Encryption using TFHE, with PBS-enabled arithmetic and a quantized integer representation to support logistic regression and a one-hidden-layer MLP. Key contributions include offline quantization calibration to produce an integer graph with LUT-based activations, a TFHE-based compilation into secure circuits, a bit-removal rounding operator for scaling with $M = 2^{-n_{r}} M_0$, and a mini-batch scheme that refreshes ciphertexts. Experimental results show convergence to plaintext fp32 accuracy on two datasets and competitive latency metrics, demonstrating practical leakage-free training.
Abstract
We present an approach to outsourcing of training neural networks while preserving data confidentiality from malicious parties. We use fully homomorphic encryption to build a unified training approach that works on encrypted data and learns quantized neural network models. The data can be horizontally or vertically split between multiple parties, enabling collaboration on confidential data. We train logistic regression and multi-layer perceptrons on several datasets.
