Table of Contents
Fetching ...

Log-concave Sampling from a Convex Body with a Barrier: a Robust and Unified Dikin Walk

Yuzhou Gu, Nikki Lijing Kuang, Yi-An Ma, Zhao Song, Lichen Zhang

TL;DR

A robust sampling framework that computes spectral approximations to the Hessian of the barrier functions in each iteration is proposed and a generalized soft-threshold Dikin walk beyond log-barrier is designed.

Abstract

We consider the problem of sampling from a $d$-dimensional log-concave distribution $π(θ) \propto \exp(-f(θ))$ for $L$-Lipschitz $f$, constrained to a convex body with an efficiently computable self-concordant barrier function, contained in a ball of radius $R$ with a $w$-warm start. We propose a \emph{robust} sampling framework that computes spectral approximations to the Hessian of the barrier functions in each iteration. We prove that for polytopes that are described by $n$ hyperplanes, sampling with the Lee-Sidford barrier function mixes within $\widetilde O((d^2+dL^2R^2)\log(w/δ))$ steps with a per step cost of $\widetilde O(nd^{ω-1})$, where $ω\approx 2.37$ is the fast matrix multiplication exponent. Compared to the prior work of Mangoubi and Vishnoi, our approach gives faster mixing time as we are able to design a generalized soft-threshold Dikin walk beyond log-barrier. We further extend our result to show how to sample from a $d$-dimensional spectrahedron, the constrained set of a semidefinite program, specified by the set $\{x\in \mathbb{R}^d: \sum_{i=1}^d x_i A_i \succeq C \}$ where $A_1,\ldots,A_d, C$ are $n\times n$ real symmetric matrices. We design a walk that mixes in $\widetilde O((nd+dL^2R^2)\log(w/δ))$ steps with a per iteration cost of $\widetilde O(n^ω+n^2d^{3ω-5})$. We improve the mixing time bound of prior best Dikin walk due to Narayanan and Rakhlin that mixes in $\widetilde O((n^2d^3+n^2dL^2R^2)\log(w/δ))$ steps.

Log-concave Sampling from a Convex Body with a Barrier: a Robust and Unified Dikin Walk

TL;DR

A robust sampling framework that computes spectral approximations to the Hessian of the barrier functions in each iteration is proposed and a generalized soft-threshold Dikin walk beyond log-barrier is designed.

Abstract

We consider the problem of sampling from a -dimensional log-concave distribution for -Lipschitz , constrained to a convex body with an efficiently computable self-concordant barrier function, contained in a ball of radius with a -warm start. We propose a \emph{robust} sampling framework that computes spectral approximations to the Hessian of the barrier functions in each iteration. We prove that for polytopes that are described by hyperplanes, sampling with the Lee-Sidford barrier function mixes within steps with a per step cost of , where is the fast matrix multiplication exponent. Compared to the prior work of Mangoubi and Vishnoi, our approach gives faster mixing time as we are able to design a generalized soft-threshold Dikin walk beyond log-barrier. We further extend our result to show how to sample from a -dimensional spectrahedron, the constrained set of a semidefinite program, specified by the set where are real symmetric matrices. We design a walk that mixes in steps with a per iteration cost of . We improve the mixing time bound of prior best Dikin walk due to Narayanan and Rakhlin that mixes in steps.
Paper Structure (67 sections, 67 theorems, 303 equations, 2 tables, 3 algorithms)

This paper contains 67 sections, 67 theorems, 303 equations, 2 tables, 3 algorithms.

Key Result

Theorem 1.1

Let $\delta\in (0,1)$ and $R\geq 1$. Given a constraint matrix $A\in \mathbb{R}^{n\times d}$ with a vector $b\in \mathbb{R}^n$, let $\mathcal{K}:=\{x\in \mathbb{R}^d: Ax\leq b \}$ be the corresponding polytope. Suppose $\mathcal{K}$ is enclosed in a ball of radius $R$ with non-empty interior. Let $f for $\epsilon=\Theta(1/d)$. Then, Algorithm alg:informal takes at most Markov chain steps. It uses

Theorems & Definitions (153)

  • Theorem 1.1: Robust sampling for log-concave distribution over polytopes
  • Theorem 1.2: Robust sampling with nearly-universal barrier
  • Theorem 1.3: Robust sampling for log-concave distribution over spectrahedra
  • Definition A.1: Fast matrix multiplication
  • Definition A.2: Fast matrix multiplication, an alternative notation
  • Lemma A.3: lgu18
  • Definition A.4: Dikin ellipsoid
  • Definition A.5: Cross-ratio distance
  • Lemma A.6: lv03
  • Lemma A.7: Lemma 1 on page 1325 of lm00
  • ...and 143 more