Automated Discovery of Improved Constant Weight Binary Codes

Christopher D. Rosin

Automated Discovery of Improved Constant Weight Binary Codes

Christopher D. Rosin

TL;DR

Improved lower bounds on A(n,d,w) are established by constructing new larger codes, for 24 values of $(n,d,w) with $6 \leq d \leq 18$ and $18 \leq n \leq 35$.

Abstract

A constant weight binary code consists of $n$-bit binary codewords, each with exactly $w$ bits equal to 1, such that any two codewords are at least Hamming distance $d$ apart. $A(n,d,w)$ is the maximum size of a constant weight binary code with parameters $n,d,w$. We establish improved lower bounds on $A(n,d,w)$ by constructing new larger codes, for 24 values of $(n,d,w)$ with $6 \leq d \leq 18$ and $18 \leq n \leq 35$. The improved lower bounds come from two strategies. The first is a tabu search that operates at the level of bit swaps. The second is a novel greedy heuristic that repeatedly chooses the candidate codeword that maximizes a randomly-scored histogram of distances to previously-added codewords. These strategies were proposed by CPro1, an automated protocol that generates, implements, and tests diverse strategies for combinatorial constructions.

Automated Discovery of Improved Constant Weight Binary Codes

TL;DR

Improved lower bounds on A(n,d,w) are established by constructing new larger codes, for 24 values of

6 \leq d \leq 18

18 \leq n \leq 35$.

Abstract

A constant weight binary code consists of

-bit binary codewords, each with exactly

bits equal to 1, such that any two codewords are at least Hamming distance

apart.

is the maximum size of a constant weight binary code with parameters

. We establish improved lower bounds on

by constructing new larger codes, for 24 values of

with

and

. The improved lower bounds come from two strategies. The first is a tabu search that operates at the level of bit swaps. The second is a novel greedy heuristic that repeatedly chooses the candidate codeword that maximizes a randomly-scored histogram of distances to previously-added codewords. These strategies were proposed by CPro1, an automated protocol that generates, implements, and tests diverse strategies for combinatorial constructions.

Paper Structure (8 sections, 8 figures, 3 tables)

This paper contains 8 sections, 8 figures, 3 tables.

Introduction
Main Results
Prior Work
Method Development
Automated Generation of Strategies with CPro1
Random-Score Distance Histograms (RSDH)
Sliced Random-Score Distance Histograms
Conclusion

Figures (8)

Figure 1: The problem definition used with CPro1.
Figure 2: CPro1's prompt for listing 20 candidate strategies. This is appended to the problem definition in Fig. \ref{['fig:problemdef']}. Half the runs here included the bold sentence of the prompt requesting strategies with a novel element; the other half use the default configuration of CPro1 which omits this extra sentence.
Figure 3: Bit-Swap Tabu Search. In the efficient implementation, Conflict-Pairs and Penalty are maintained incrementally. Note HD($x,y$) is the Hamming Distance between binary words $x$ and $y$. We set $t_{min}=5$, $t_{max}=15$, max_no_improve_restart=1M.
Figure 4: Random-Score Distance Histograms (RSDH). $n,w,d$ are the parameters of the code, and $s$ is the target size (number of codewords). Note HD($x,y$) is the Hamming Distance between binary words $x$ and $y$.
Figure 5: Sliced Random-Score Distance Histograms (S-RSDH). $b$ is the number of bits that define the slice; we use $b \in \{1,2,3\}$. $t$ is the number of trials for the remainder of the code; we fix $t=1000$. Greedy-Build, Rand-Vec, and HD are defined in Fig. \ref{['fig:rsdhpseudocode']}.
...and 3 more figures

Automated Discovery of Improved Constant Weight Binary Codes

TL;DR

Abstract

Automated Discovery of Improved Constant Weight Binary Codes

Authors

TL;DR

Abstract

Table of Contents

Figures (8)