Table of Contents
Fetching ...

Improving ML Attacks on LWE with Data Repetition and Stepwise Regression

Alberto Alfarano, Eshika Saxena, Emily Wenger, François Charton, Kristin Lauter

Abstract

The Learning with Errors (LWE) problem is a hard math problem in lattice-based cryptography. In the simplest case of binary secrets, it is the subset sum problem, with error. Effective ML attacks on LWE were demonstrated in the case of binary, ternary, and small secrets, succeeding on fairly sparse secrets. The ML attacks recover secrets with up to 3 active bits in the "cruel region" (Nolte et al., 2024) on samples pre-processed with BKZ. We show that using larger training sets and repeated examples enables recovery of denser secrets. Empirically, we observe a power-law relationship between model-based attempts to recover the secrets, dataset size, and repeated examples. We introduce a stepwise regression technique to recover the "cool bits" of the secret.

Improving ML Attacks on LWE with Data Repetition and Stepwise Regression

Abstract

The Learning with Errors (LWE) problem is a hard math problem in lattice-based cryptography. In the simplest case of binary secrets, it is the subset sum problem, with error. Effective ML attacks on LWE were demonstrated in the case of binary, ternary, and small secrets, succeeding on fairly sparse secrets. The ML attacks recover secrets with up to 3 active bits in the "cruel region" (Nolte et al., 2024) on samples pre-processed with BKZ. We show that using larger training sets and repeated examples enables recovery of denser secrets. Empirically, we observe a power-law relationship between model-based attempts to recover the secrets, dataset size, and repeated examples. We introduce a stepwise regression technique to recover the "cool bits" of the secret.

Paper Structure

This paper contains 15 sections, 6 equations, 2 figures, 31 tables, 1 algorithm.

Figures (2)

  • Figure 1: Model parameters $N$ vs model based attempts $A$ for three secrets with different Hamming weights $h$.
  • Figure 2: Total training data $D$ and repetition $R$ vs model based attempts $A$ for one secret with Hamming weight $h=70$. Total distinct data can be computed as $D/R$.