Table of Contents
Fetching ...

A Pervasive, Efficient and Private Future: Realizing Privacy-Preserving Machine Learning Through Hybrid Homomorphic Encryption

Khoa Nguyen, Mindaugas Budzys, Eugene Frimpong, Tanveer Khan, Antonis Michalas

TL;DR

This work proposes Hybrid Homomorphic Encryption (HHE) as a scalable path to privacy-preserving ML, introducing two PPML protocols—3PervPPML and TrustedPervPPML—that push heavy HE work to the cloud and enable edge devices to participate securely. By leveraging a BFV-based HHE scheme with the PASTA construction, the authors demonstrate reduced ciphertext sizes and overall computational/communication overhead, including a TEE-enabled variant for zero-trust scenarios. They validate the approach with a real-world ECG classification application on MIT-BIH data, showing encrypted inference can approach plaintext accuracy while CSP shoulders most of the computation, and provide detailed performance benchmarks. The results indicate that HHE-based PPML can deliver practical, private ML services across pervasive devices, balancing security guarantees with efficiency and enabling broader deployment of privacy-preserving ML in edge-centric ecosystems.

Abstract

Machine Learning (ML) has become one of the most impactful fields of data science in recent years. However, a significant concern with ML is its privacy risks due to rising attacks against ML models. Privacy-Preserving Machine Learning (PPML) methods have been proposed to mitigate the privacy and security risks of ML models. A popular approach to achieving PPML uses Homomorphic Encryption (HE). However, the highly publicized inefficiencies of HE make it unsuitable for highly scalable scenarios with resource-constrained devices. Hence, Hybrid Homomorphic Encryption (HHE) -- a modern encryption scheme that combines symmetric cryptography with HE -- has recently been introduced to overcome these challenges. HHE potentially provides a foundation to build new efficient and privacy-preserving services that transfer expensive HE operations to the cloud. This work introduces HHE to the ML field by proposing resource-friendly PPML protocols for edge devices. More precisely, we utilize HHE as the primary building block of our PPML protocols. We assess the performance of our protocols by first extensively evaluating each party's communication and computational cost on a dummy dataset and show the efficiency of our protocols by comparing them with similar protocols implemented using plain BFV. Subsequently, we demonstrate the real-world applicability of our construction by building an actual PPML application that uses HHE as its foundation to classify heart disease based on sensitive ECG data.

A Pervasive, Efficient and Private Future: Realizing Privacy-Preserving Machine Learning Through Hybrid Homomorphic Encryption

TL;DR

This work proposes Hybrid Homomorphic Encryption (HHE) as a scalable path to privacy-preserving ML, introducing two PPML protocols—3PervPPML and TrustedPervPPML—that push heavy HE work to the cloud and enable edge devices to participate securely. By leveraging a BFV-based HHE scheme with the PASTA construction, the authors demonstrate reduced ciphertext sizes and overall computational/communication overhead, including a TEE-enabled variant for zero-trust scenarios. They validate the approach with a real-world ECG classification application on MIT-BIH data, showing encrypted inference can approach plaintext accuracy while CSP shoulders most of the computation, and provide detailed performance benchmarks. The results indicate that HHE-based PPML can deliver practical, private ML services across pervasive devices, balancing security guarantees with efficiency and enabling broader deployment of privacy-preserving ML in edge-centric ecosystems.

Abstract

Machine Learning (ML) has become one of the most impactful fields of data science in recent years. However, a significant concern with ML is its privacy risks due to rising attacks against ML models. Privacy-Preserving Machine Learning (PPML) methods have been proposed to mitigate the privacy and security risks of ML models. A popular approach to achieving PPML uses Homomorphic Encryption (HE). However, the highly publicized inefficiencies of HE make it unsuitable for highly scalable scenarios with resource-constrained devices. Hence, Hybrid Homomorphic Encryption (HHE) -- a modern encryption scheme that combines symmetric cryptography with HE -- has recently been introduced to overcome these challenges. HHE potentially provides a foundation to build new efficient and privacy-preserving services that transfer expensive HE operations to the cloud. This work introduces HHE to the ML field by proposing resource-friendly PPML protocols for edge devices. More precisely, we utilize HHE as the primary building block of our PPML protocols. We assess the performance of our protocols by first extensively evaluating each party's communication and computational cost on a dummy dataset and show the efficiency of our protocols by comparing them with similar protocols implemented using plain BFV. Subsequently, we demonstrate the real-world applicability of our construction by building an actual PPML application that uses HHE as its foundation to classify heart disease based on sensitive ECG data.
Paper Structure (17 sections, 1 theorem, 3 equations, 2 figures, 7 tables)

This paper contains 17 sections, 1 theorem, 3 equations, 2 figures, 7 tables.

Key Result

Proposition 1

Let $res$ be the inference result of a multi-layered ML model $f$, $\mathsf{HE}$ a semantically secure homomorphic encryption scheme and $\mathsf{PKE}$ an IND-CCA2 public key encryption scheme. We assume that $\mathcal{ADV}$ can corrupt the CSP in PervPPML, hence gaining access to $(c_w, c_b)$ and $

Figures (2)

  • Figure 1: $\mathsf{TrustedPervPPML}$
  • Figure 2: Quantizing ECG data into 4-bit integers. Top: Floating-point ECG data. Bottom: Corresponding quantized ECG data.

Theorems & Definitions (3)

  • Definition 1: Hybrid Homomorphic Encryption
  • Proposition 1: Data Reconstruction Attack Soundness
  • proof