(Near)-Optimal Algorithms for Sparse Separable Convex Integer Programs
Christoph Hunkenschröder, Martin Koutecký, Asaf Levin, Tung Anh Vu
TL;DR
The paper tackles the problem of optimizing a separable convex function over the integer points in a polyhedron defined by $A\mathbf{x}=\mathbf{b}$ and $\mathbf{l} \leq \mathbf{x} \leq \mathbf{u}$, focusing on block-structured matrices with small coefficients and low primal or dual treedepth. It develops two near-linear-time algorithms: a primal algorithm with time $g(td_P(A), \|A\|_\infty) \; n \log\max(\|\mathbf{u}-\mathbf{l}\|_\infty, \|\mathbf{b}\|_\infty)$ that matches the information-theoretic lower bound up to parameter dependencies, and a dual algorithm with time $g(td_D(A), \|A\|_\infty) \; n \log n \log\max(\|\mathbf{u}-\mathbf{l}\|_\infty, \|\mathbf{b}\|_\infty)$, conjectured to be optimal up to log factors. The approach blends scaling, proximity, and sensitivity analyses with a new convolution-tree dynamic data structure to enable sparse, fast updates, and it applies to $n$-fold, 2-stage, multi-stage, and tree-fold matrices. This work fills a gap between linear objective results and general separable convex objectives, offering efficient algorithms for a broad class of block-structured IPs with practical implications for optimization and statistical learning.
Abstract
We study the general integer programming (IP) problem of optimizing a separable convex function over the integer points of a polytope: $\min \{f(\mathbf{x}) \mid A\mathbf{x} = \mathbf{b}, \, \mathbf{l} \leq \mathbf{x} \leq \mathbf{u}, \, \mathbf{x} \in \mathbb{Z}^n\}$. The number of variables $n$ is a variable part of the input, and we consider the regime where the constraint matrix $A$ has small coefficients $\|A\|_\infty$ and small primal or dual treedepth $\mathrm{td}_P(A)$ or $\mathrm{td}_D(A)$, respectively. Equivalently, we consider block-structured matrices, in particular $n$-fold, tree-fold, $2$-stage and multi-stage matrices. We ask about the possibility of near-linear time algorithms in the general case of (non-linear) separable convex functions. The techniques of previous works for the linear case are inherently limited to it; in fact, no strongly-polynomial algorithm may exist due to a simple unconditional information-theoretic lower bound of $n \log \|\mathbf{u}-\mathbf{l}\|_\infty$, where $\mathbf{l}, \mathbf{u}$ are the vectors of lower and upper bounds. Our first result is that with parameters $\mathrm{td}_P(A)$ and $\|A\|_\infty$, this lower bound can be matched (up to dependency on the parameters). Second, with parameters $\mathrm{td}_D(A)$ and $\|A\|_\infty$, the situation is more involved, and we design an algorithm with time complexity $g(\mathrm{td}_D(A), \|A\|_\infty) n \log n \log \|\mathbf{u}-\mathbf{l}\|_\infty$ where $g$ is some computable function. We conjecture that a stronger lower bound is possible in this regime, and our algorithm is in fact optimal. Our algorithms combine ideas from scaling, proximity, and sensitivity of integer programs, together with a new dynamic data structure.
