Faster Algorithm for Structured John Ellipsoid Computation
Yang Cao, Xiaoyu Li, Zhao Song, Xin Yang, Tianyi Zhou
TL;DR
The paper addresses the computational challenge of finding the John Ellipsoid inside a centrally symmetric polytope, formalized as $P = \{ x : -\mathbf{1}_n \le Ax \le \mathbf{1}_n \}$ with rank-$d$ matrix $A$. It introduces two fast approximation methods: a sketching-based algorithm that achieves near input-sparsity time with per-iteration cost $\widetilde{O}(\epsilon^{-1}\mathrm{nnz}(A) + \epsilon^{-2} d^{\omega})$ and total iterations $T = O(\epsilon^{-1}\log(n/d))$, and a treewidth-based algorithm that runs in $O(n\tau^2)$ per iteration when the dual graph of $A$ has treewidth $\tau$. Both approaches preserve containment relations $\frac{1}{\sqrt{1+\epsilon}} Q \subseteq P \subseteq \sqrt{d}\,Q$ for the computed ellipsoid $Q$, improving upon the previous $\widetilde{O}(n d^2)$ per-iteration cost. The methods rely on leveraging score approximations, sampling, and sketching in the input-sparsity setting, and on a $\tau$-sparse Cholesky-based decomposition in the small-treewidth setting, with a telescoping analysis to bound error accumulation. Together, these results yield practically faster algorithms for computing near-optimal John Ellipsoids, with implications for D-optimal design and related convex-optimization tasks in high dimensions.
Abstract
The famous theorem of Fritz John states that any convex body has a unique maximal volume inscribed ellipsoid, known as the John Ellipsoid. Computing the John Ellipsoid is a fundamental problem in convex optimization. In this paper, we focus on approximating the John Ellipsoid inscribed in a convex and centrally symmetric polytope defined by $ P := \{ x \in \mathbb{R}^d : -\mathbf{1}_n \leq A x \leq \mathbf{1}_n \},$ where $ A \in \mathbb{R}^{n \times d} $ is a rank-$d$ matrix and $ \mathbf{1}_n \in \mathbb{R}^n $ is the all-ones vector. We develop two efficient algorithms for approximating the John Ellipsoid. The first is a sketching-based algorithm that runs in nearly input-sparsity time $ \widetilde{O}(\mathrm{nnz}(A) + d^ω) $, where $ \mathrm{nnz}(A) $ denotes the number of nonzero entries in the matrix $A$ and $ ω\approx 2.37$ is the current matrix multiplication exponent. The second is a treewidth-based algorithm that runs in time $ \widetilde{O}(n τ^2)$, where $τ$ is the treewidth of the dual graph of the matrix $A$. Our algorithms significantly improve upon the state-of-the-art running time of $ \widetilde{O}(n d^2) $ achieved by [Cohen, Cousins, Lee, and Yang, COLT 2019].
