Unbiased Single-Queried Gradient for Combinatorial Objective

Thanawat Sornwanee

Unbiased Single-Queried Gradient for Combinatorial Objective

Thanawat Sornwanee

TL;DR

This paper tackles optimizing combinatorial objectives defined on binary vectors by relaxing to a product-Bernoulli multilinear extension $v(x)$ and seeking gradients in the continuous space. It introduces Easy Stochastic Gradient (ESG), an unbiased single-queried gradient estimator that yields $\mathbb{E}[\nabla_x V(x;\omega,Q)]=\nabla_x v(x;Q)$ while using only one oracle call per realization, enabling gradient-based methods with autodiff. ESG relies on a carefully designed stochastic evaluation $V$ built from a good tuple $(f,\sigma,\hat{σ})$ and a per-coordinate factorization, unifying ties to REINFORCE through importance sampling and generating new estimators (e.g., Long Jump, Arch, Spike) with variance considerations. The paper further develops Calibration, Variance control, and a Stochastic Query Descent (SQD) framework, showing through symmetric-slice and encoded variants that one-query, unbiased, differentiable optimization is viable for large-scale combinatorial problems. The encoded ESG/EncodedSQD variants illustrate practical gains in computation and variance, highlighting the approach’s potential for scalable, black-box combinatorial optimization where oracle calls dominate cost.

Abstract

In a probabilistic reformulation of a combinatorial problem, we often face an optimization over a hypercube, which corresponds to the Bernoulli probability parameter for each binary variable in the primal problem. The combinatorial nature suggests that an exact gradient computation requires multiple queries. We propose a stochastic gradient that is unbiased and requires only a single query of the combinatorial function. This method encompasses a well-established REINFORCE (through an importance sampling), as well as including a class of new stochastic gradients.

Unbiased Single-Queried Gradient for Combinatorial Objective

TL;DR

This paper tackles optimizing combinatorial objectives defined on binary vectors by relaxing to a product-Bernoulli multilinear extension

and seeking gradients in the continuous space. It introduces Easy Stochastic Gradient (ESG), an unbiased single-queried gradient estimator that yields

while using only one oracle call per realization, enabling gradient-based methods with autodiff. ESG relies on a carefully designed stochastic evaluation

built from a good tuple

and a per-coordinate factorization, unifying ties to REINFORCE through importance sampling and generating new estimators (e.g., Long Jump, Arch, Spike) with variance considerations. The paper further develops Calibration, Variance control, and a Stochastic Query Descent (SQD) framework, showing through symmetric-slice and encoded variants that one-query, unbiased, differentiable optimization is viable for large-scale combinatorial problems. The encoded ESG/EncodedSQD variants illustrate practical gains in computation and variance, highlighting the approach’s potential for scalable, black-box combinatorial optimization where oracle calls dominate cost.

Abstract

Paper Structure (31 sections, 18 theorems, 102 equations, 6 figures, 4 algorithms)

This paper contains 31 sections, 18 theorems, 102 equations, 6 figures, 4 algorithms.

Introduction
Why the obvious approach fails.
Related Work
Score-function estimators and variance reduction
Continuous relaxations and straight-through gradients
Zeroth-order / black-box gradient methods
Stochastic Gradient
Easy Stochastic Gradient
Examples
Calibration
Easy Stochastic Gradient is not Calibrated
Variance, Calibration, & Importance Sampling
Stochastic Query Descent
Experiment: Symmetric-Slice Optimization
Preliminary Result
...and 16 more sections

Key Result

Proposition 3.3

With respect to $\left( \Omega, \mathcal{F}, \mathbb{P} \right)$, $V$ is a stochastic evaluation function w with $f_v$ and $K$ as in the definition def:seval if and only if, for all $q \in \mathbb{R}$, $x \in (0,1)^d$, and $y \in \{0,1\}^d$,

Figures (6)

Figure 1: Computational graph for our ESG algorithm \ref{['alg:ESG']}: The yellow arrows represent operations that allow backpropagation of gradient, while the red one, which requires either rounding or querying, does not. The stochastic value $v$ itself can be taken gradient with respect to $x$, while yielding an unbiased gradient estimator.
Figure 2: Median of the best-so-far progress of different algorithms with inter-quartiles uncertainty. The algorithm we proposed are in bold lines, and the pre-existing algorithms are in dashed lines. This result comes from the case when the dimension $d= 30$.
Figure 3: Result for the experiment for symmetric-slice optimization from \ref{['subsection:symmetricslice']}: Median of the best-so-far progress of different algorithms with inter-quartiles uncertainty. The algorithm we proposed are in bold line, and the pre-existing algorithms are in a dashed line. The experiment comes from $20$ trials. Note that the time-scale for the 3 plots (with 3 different dimensions) are different.
Figure 4: Result for the experiment for symmetric-slice optimization from \ref{['subsection:symmetricslice']}: The bold lines are our SQD algorithms, while the dashed lines are our Encoded SQD algorithms. The encoding function $\hat{\sigma}^{-1}\left( \cdot \right)$ is linear in the case of "SQD: LongJump", so the two methods, "SQD: LongJump" and 'EncodedSQD: LongJump" have roughly the same performance.
Figure 5: Result for the experiment for modified Knapsack problem: Median of the best-so-far progress of different algorithms with inter-quartiles uncertainty. The algorithm we proposed are in bold line, and the pre-existing algorithms are in a dashed line. The experiment comes from $20$ trials. Note that the time-scale for the 3 plots (with 3 different dimensions) are different.
...and 1 more figures

Theorems & Definitions (53)

Definition 3.1
Definition 3.2
Proposition 3.3
Definition 3.4
Corollary 3.5
proof
Remark 3.6: Stochastic Evaluation by Stochastic Key
Remark 3.7: REINFROCE Gradient
Definition 3.8
Theorem 3.9
...and 43 more

Unbiased Single-Queried Gradient for Combinatorial Objective

TL;DR

Abstract

Unbiased Single-Queried Gradient for Combinatorial Objective

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (53)