Revisit the Partial Coloring Method: Prefix Spencer and Sampling

Dongrun Cai; Xue Chen; Wenxuan Shu; Haoyu Wang; Guangyi Zou

Revisit the Partial Coloring Method: Prefix Spencer and Sampling

Dongrun Cai, Xue Chen, Wenxuan Shu, Haoyu Wang, Guangyi Zou

TL;DR

This paper advances discrepancy theory by dissecting the partial coloring method through two complementary algorithmic lenses: a linear-algebraic framework and a Gaussian measure framework. It proves that, for $A\in\{0,1\}^{m\times n}$ with $n\ge m$, one can obtain a prefix coloring with prefix discrepancy $O(\sqrt{m})$ while fixing $\Omega(n)$ entries, and recursively achieve a full coloring with prefix discrepancy $O(\sqrt{m}\cdot\log\frac{O(n)}{m})$; it also shows a Gaussian-measure-based approach yielding the same prefix bound with $\Omega(n)$ fixed entries and a full coloring within the same order up to a logarithmic factor, underpinned by small-deviation bounds for Gaussian processes. Beyond the Spencer setting, the authors extend the linear-algebraic framework to a sampling procedure with min-entropy $\Omega(n)$ via leverage scores, enabling diverse colorings and applying to Beck-Fiala and vector balancing. They further adapt these ideas to prefix discrepancy, online discrepancy, and matrix discrepancy contexts, highlighting practical paths to robust, randomized colorings with strong discrepancy guarantees. The work thus unifies and extends algorithmic methods in discrepancy theory, providing near-optimal prefix-discrepancy results and practical sampling techniques with provable entropy properties. This yields more versatile tools for discrepancy minimization and applications like Beck-Fiala, prefix discrepancy, and matrix sparsification, with implications for randomized design and online settings.

Abstract

As the most powerful tool in discrepancy theory, the partial coloring method has wide applications in many problems including the Beck-Fiala problem and Spencer's celebrated result. Currently, there are two major algorithmic methods for the partial coloring method: the first approach uses linear algebraic tools; and the second is called Gaussian measure algorithm. We explore the advantages of these two methods and show the following results for them separately. 1. Spencer conjectured that the prefix discrepancy of any $\mathbf{A} \in \{0,1\}^{m \times n}$ is $O(\sqrt{m})$. We show how to find a partial coloring with prefix discrepancy $O(\sqrt{m})$ and $Ω(n)$ entries in $\{ \pm 1\}$ efficiently. To the best of our knowledge, this provides the first partial coloring whose prefix discrepancy is almost optimal. However, unlike the classical discrepancy problem, there is no reduction on the number of variables $n$ for the prefix problem. By recursively applying partial coloring, we obtain a full coloring with prefix discrepancy $O(\sqrt{m} \cdot \log \frac{O(n)}{m})$. Prior to this work, the best bounds of the prefix Spencer conjecture for arbitrarily large $n$ were $2m$ and $O(\sqrt{m \log n})$. 2. Our second result extends the first linear algebraic approach to a sampling algorithm in Spencer's classical setting. On the first hand, Spencer proved that there are $1.99^m$ good colorings with discrepancy $O(\sqrt{m})$. Hence a natural question is to design efficient random sampling algorithms in Spencer's setting. On the other hand, some applications of discrepancy theory, prefer a random solution instead of a fixed one. Our second result is an efficient sampling algorithm whose random output has min-entropy $Ω(n)$ and discrepancy $O(\sqrt{m})$. Moreover, our technique extends the linear algebraic framework by incorporating leverage scores of randomized matrix algorithms.

Revisit the Partial Coloring Method: Prefix Spencer and Sampling

TL;DR

with

, one can obtain a prefix coloring with prefix discrepancy

while fixing

entries, and recursively achieve a full coloring with prefix discrepancy

; it also shows a Gaussian-measure-based approach yielding the same prefix bound with

fixed entries and a full coloring within the same order up to a logarithmic factor, underpinned by small-deviation bounds for Gaussian processes. Beyond the Spencer setting, the authors extend the linear-algebraic framework to a sampling procedure with min-entropy

via leverage scores, enabling diverse colorings and applying to Beck-Fiala and vector balancing. They further adapt these ideas to prefix discrepancy, online discrepancy, and matrix discrepancy contexts, highlighting practical paths to robust, randomized colorings with strong discrepancy guarantees. The work thus unifies and extends algorithmic methods in discrepancy theory, providing near-optimal prefix-discrepancy results and practical sampling techniques with provable entropy properties. This yields more versatile tools for discrepancy minimization and applications like Beck-Fiala, prefix discrepancy, and matrix sparsification, with implications for randomized design and online settings.

Abstract

. We show how to find a partial coloring with prefix discrepancy

and

entries in

efficiently. To the best of our knowledge, this provides the first partial coloring whose prefix discrepancy is almost optimal. However, unlike the classical discrepancy problem, there is no reduction on the number of variables

for the prefix problem. By recursively applying partial coloring, we obtain a full coloring with prefix discrepancy

. Prior to this work, the best bounds of the prefix Spencer conjecture for arbitrarily large

were

and

. 2. Our second result extends the first linear algebraic approach to a sampling algorithm in Spencer's classical setting. On the first hand, Spencer proved that there are

good colorings with discrepancy

. Hence a natural question is to design efficient random sampling algorithms in Spencer's setting. On the other hand, some applications of discrepancy theory, prefer a random solution instead of a fixed one. Our second result is an efficient sampling algorithm whose random output has min-entropy

and discrepancy

. Moreover, our technique extends the linear algebraic framework by incorporating leverage scores of randomized matrix algorithms.

Paper Structure (40 sections, 28 theorems, 115 equations, 7 algorithms)

This paper contains 40 sections, 28 theorems, 115 equations, 7 algorithms.

Introduction
Our Results
Related Works
Algorithms in the linear-algebraic framework.
Prefix Discrepancy Theory.
The Gaussian measure algorithm.
Online Discrepancy Theory.
Discussion
Organization.
Preliminaries
Discrepancy theory.
Prefix discrepancy.
Overview
Prefix Discrepancy Theory.
Sampling.
...and 25 more sections

Key Result

Theorem 1.1

Given any $\mathbf{A} \in \{0,1\}^{m \times n}$ with $n \ge m$, the Gaussian measure algorithm finds a partial coloring $\mathbf{x} \in [-1,1]^n$ such that Moreover, there exists an efficient algorithm to find a full coloring $\mathbf{x} \in\{\pm 1\}^n$ whose prefix discrepancy is $O(\sqrt{m} \cdot \log \frac{O(n)}{m})$.

Theorems & Definitions (43)

Theorem 1.1
Theorem 1.2
Theorem 2.1
Lemma 3.1: Lévy's inequality
Claim 3.2
Theorem 4.1
Corollary 4.2
Lemma 4.3
Lemma 4.4
Lemma 4.5
...and 33 more

Revisit the Partial Coloring Method: Prefix Spencer and Sampling

TL;DR

Abstract

Revisit the Partial Coloring Method: Prefix Spencer and Sampling

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (43)