Table of Contents
Fetching ...

Fast Dimensionality Reduction from $\ell_2$ to $\ell_p$

Rafael Chiclana, Mark Iwen

TL;DR

The paper provides a simple, fast linear embedding from $\ell_2$ to $\ell_p$ for all $p\in[1,2]$ that preserves Euclidean distances up to a factor $1+\varepsilon$ in the target $\ell_p$ norm, with a runtime of $\mathcal{O}(d \log k)$ when the target dimension satisfies $k \le d^{1/4}$. The construction uses a 4-wise independent matrix $A$, independent Rademacher diagonals, and Hadamard transforms in the form $\Psi x = k^{-1/p} \beta_p^{-1} A D_1 H D_2 H D_3 x$, enabling effective dimension reduction with controlled distortion and high probability guarantees. A key technical contribution is the combination of Talagrand concentration and a Berry-Esseen-type bound for the $\ell_p$ norm, together with a Hadamard-based flattening that ensures coordinate mass is sufficiently spread. The work also proves a general lower bound for embeddings into any target norm, showing $k$ must be at least $\Omega( \varepsilon^{-2} \log(\varepsilon^2 n)/\log(1/\varepsilon) )$, matching the Euclidean lower bound up to log factors. This advances fast, structure-friendly dimensionality reduction for $\ell_p$-based applications, including nearest neighbor search and robust learning tasks.

Abstract

The Johnson-Lindenstrauss (JL) lemma is a fundamental result in dimensionality reduction, ensuring that any finite set $X \subseteq \mathbb{R}^d$ can be embedded into a lower-dimensional space $\mathbb{R}^k$ while approximately preserving all pairwise Euclidean distances. In recent years, embeddings that preserve Euclidean distances when measured via the $\ell_1$ norm in the target space have received increasing attention due to their relevance in applications such as nearest neighbor search in high dimensions. A recent breakthrough by Dirksen, Mendelson, and Stollenwerk established an optimal $\ell_2 \to \ell_1$ embedding with computational complexity $O(d \log d)$. In this work, we generalize this direction and propose a simple linear embedding from $\ell_2$ to $\ell_p$ for any $p \in [1,2]$ based on a construction of Ailon and Liberty. Our method achieves a reduced runtime of $O(d \log k)$ when $k \leq d^{1/4}$, improving upon prior runtime results when the target dimension is small. Additionally, we show that for \emph{any norm} $\|\cdot\|$ in the target space, any embedding of $(\mathbb{R}^d, \|\cdot\|_2)$ into $(\mathbb{R}^k, \|\cdot\|)$ with distortion $\varepsilon$ generally requires $k = Ω\big(\varepsilon^{-2} \log(\varepsilon^2 n)/\log(1/\varepsilon)\big)$, matching the optimal bound for the $\ell_2$ case up to a logarithmic factor.

Fast Dimensionality Reduction from $\ell_2$ to $\ell_p$

TL;DR

The paper provides a simple, fast linear embedding from to for all that preserves Euclidean distances up to a factor in the target norm, with a runtime of when the target dimension satisfies . The construction uses a 4-wise independent matrix , independent Rademacher diagonals, and Hadamard transforms in the form , enabling effective dimension reduction with controlled distortion and high probability guarantees. A key technical contribution is the combination of Talagrand concentration and a Berry-Esseen-type bound for the norm, together with a Hadamard-based flattening that ensures coordinate mass is sufficiently spread. The work also proves a general lower bound for embeddings into any target norm, showing must be at least , matching the Euclidean lower bound up to log factors. This advances fast, structure-friendly dimensionality reduction for -based applications, including nearest neighbor search and robust learning tasks.

Abstract

The Johnson-Lindenstrauss (JL) lemma is a fundamental result in dimensionality reduction, ensuring that any finite set can be embedded into a lower-dimensional space while approximately preserving all pairwise Euclidean distances. In recent years, embeddings that preserve Euclidean distances when measured via the norm in the target space have received increasing attention due to their relevance in applications such as nearest neighbor search in high dimensions. A recent breakthrough by Dirksen, Mendelson, and Stollenwerk established an optimal embedding with computational complexity . In this work, we generalize this direction and propose a simple linear embedding from to for any based on a construction of Ailon and Liberty. Our method achieves a reduced runtime of when , improving upon prior runtime results when the target dimension is small. Additionally, we show that for \emph{any norm} in the target space, any embedding of into with distortion generally requires , matching the optimal bound for the case up to a logarithmic factor.

Paper Structure

This paper contains 10 sections, 11 theorems, 61 equations.

Key Result

Theorem 1.1

Let $\varepsilon, \rho \in (0, 1)$, $p \in [1,2]$, and $X \subseteq \mathbb{R}^d$ be a finite set with $|X| = n$. Suppose the embedding dimension $k$ satisfies where $C_0$ is the absolute constant in Lemma lem: berry-esseen. Then, with probability at least $1 - \rho$, the linear map $\Psi \colon \mathbb{R}^d \to \mathbb{R}^k$, defined in eq: def embedding, can be applied to any input vector in $\

Theorems & Definitions (20)

  • Theorem 1.1
  • Theorem 1.2
  • Theorem 1.3
  • Theorem 3.1: Talagrand's concentration inequality
  • Lemma 3.2: (5.2) and Lemma 5.1 in Ailon2008fast
  • Corollary 3.3
  • Lemma 3.4
  • proof
  • Remark 3.5
  • Lemma 3.6
  • ...and 10 more