Unbiased Single-Queried Gradient for Combinatorial Objective
Thanawat Sornwanee
TL;DR
This paper tackles optimizing combinatorial objectives defined on binary vectors by relaxing to a product-Bernoulli multilinear extension $v(x)$ and seeking gradients in the continuous space. It introduces Easy Stochastic Gradient (ESG), an unbiased single-queried gradient estimator that yields $\mathbb{E}[\nabla_x V(x;\omega,Q)]=\nabla_x v(x;Q)$ while using only one oracle call per realization, enabling gradient-based methods with autodiff. ESG relies on a carefully designed stochastic evaluation $V$ built from a good tuple $(f,\sigma,\hat{σ})$ and a per-coordinate factorization, unifying ties to REINFORCE through importance sampling and generating new estimators (e.g., Long Jump, Arch, Spike) with variance considerations. The paper further develops Calibration, Variance control, and a Stochastic Query Descent (SQD) framework, showing through symmetric-slice and encoded variants that one-query, unbiased, differentiable optimization is viable for large-scale combinatorial problems. The encoded ESG/EncodedSQD variants illustrate practical gains in computation and variance, highlighting the approach’s potential for scalable, black-box combinatorial optimization where oracle calls dominate cost.
Abstract
In a probabilistic reformulation of a combinatorial problem, we often face an optimization over a hypercube, which corresponds to the Bernoulli probability parameter for each binary variable in the primal problem. The combinatorial nature suggests that an exact gradient computation requires multiple queries. We propose a stochastic gradient that is unbiased and requires only a single query of the combinatorial function. This method encompasses a well-established REINFORCE (through an importance sampling), as well as including a class of new stochastic gradients.
