Table of Contents
Fetching ...

Graph and String Parameters: Connections Between Pathwidth, Cutwidth and the Locality Number

Katrin Casel, Joel D. Day, Pamela Fleischmann, Tomasz Kociumaka, Florin Manea, Markus L. Schmid

TL;DR

This paper investigates the locality number $\operatorname{loc}(\alpha)$ of strings and its deep connections to two central graph layout parameters, cutwidth $\operatorname{cw}(G)$ and pathwidth $\operatorname{pw}(G)$. It establishes approximation-preserving reductions between these domains, proving that computing $\operatorname{loc}$ is NP-hard but fixed-parameter tractable when parameterised by the alphabet size $|\Sigma|$ or by $\operatorname{loc}(\alpha)$, and giving a $O(\sqrt{\log(\mathrm{opt})}\log n)$-approximation for MinLoc. The paper also shows how approximations for pathwidth translate into cutwidth approximations via new reductions, yielding concrete bounds such as $O(\sqrt{\log(\mathrm{opt})}\log h)$ and $O(\sqrt{\log(\mathrm{opt})}\mathrm{opt})$ for MinCutwidth on multigraphs, and introduces a second-order cutwidth $\operatorname{cw}_2$ to bridge cutwidth and pathwidth directly. Overall, these results fuse string combinatorics with classical graph parameters, enabling transfer of algorithmic techniques and closing several open questions about the locality parameter. The findings have practical relevance for pattern matching with variables and for leveraging graph-layout heuristics in string-processing problems.

Abstract

We investigate the locality number, a recently introduced structural parameter for strings (with applications in pattern matching with variables), and its connection to two important graph-parameters, cutwidth and pathwidth. These connections allow us to show that computing the locality number is NP-hard, but fixed-parameter tractable, if parameterised by the locality number or by the alphabet size, which has been formulated as open problems in the literature. Moreover, the locality number can be approximated with ratio O(sqrt(log(opt)) log(n)). An important aspect of our work -- that is relevant in its own right and of independent interest -- is that we identify connections between the string parameter of the locality number on the one hand, and the famous graph parameters of cutwidth and pathwidth, on the other hand. These two parameters have been jointly investigated in the literature and are arguably among the most central graph parameters that are based on "linearisations" of graphs. In this way, we also identify a direct approximation preserving reduction from cutwidth to pathwidth, which shows that any polynomial f(opt,|V|)-approximation algorithm for pathwidth yields a polynomial 2f(2 opt,h)-approximation algorithm for cutwidth on multigraphs (where h is the number of edges). In particular, this translates known approximation ratios for pathwidth into new approximation ratios for cutwidth, namely O(sqrt(log(opt)) log(h)) and O(sqrt(log(opt)) opt) for (multi) graphs with h edges.

Graph and String Parameters: Connections Between Pathwidth, Cutwidth and the Locality Number

TL;DR

This paper investigates the locality number of strings and its deep connections to two central graph layout parameters, cutwidth and pathwidth . It establishes approximation-preserving reductions between these domains, proving that computing is NP-hard but fixed-parameter tractable when parameterised by the alphabet size or by , and giving a -approximation for MinLoc. The paper also shows how approximations for pathwidth translate into cutwidth approximations via new reductions, yielding concrete bounds such as and for MinCutwidth on multigraphs, and introduces a second-order cutwidth to bridge cutwidth and pathwidth directly. Overall, these results fuse string combinatorics with classical graph parameters, enabling transfer of algorithmic techniques and closing several open questions about the locality parameter. The findings have practical relevance for pattern matching with variables and for leveraging graph-layout heuristics in string-processing problems.

Abstract

We investigate the locality number, a recently introduced structural parameter for strings (with applications in pattern matching with variables), and its connection to two important graph-parameters, cutwidth and pathwidth. These connections allow us to show that computing the locality number is NP-hard, but fixed-parameter tractable, if parameterised by the locality number or by the alphabet size, which has been formulated as open problems in the literature. Moreover, the locality number can be approximated with ratio O(sqrt(log(opt)) log(n)). An important aspect of our work -- that is relevant in its own right and of independent interest -- is that we identify connections between the string parameter of the locality number on the one hand, and the famous graph parameters of cutwidth and pathwidth, on the other hand. These two parameters have been jointly investigated in the literature and are arguably among the most central graph parameters that are based on "linearisations" of graphs. In this way, we also identify a direct approximation preserving reduction from cutwidth to pathwidth, which shows that any polynomial f(opt,|V|)-approximation algorithm for pathwidth yields a polynomial 2f(2 opt,h)-approximation algorithm for cutwidth on multigraphs (where h is the number of edges). In particular, this translates known approximation ratios for pathwidth into new approximation ratios for cutwidth, namely O(sqrt(log(opt)) log(h)) and O(sqrt(log(opt)) opt) for (multi) graphs with h edges.

Paper Structure

This paper contains 23 sections, 32 theorems, 14 equations, 8 figures.

Key Result

Lemma 3.1

Let $w=u^i$ be the $i$-times repetition of $u\in X^{\ast}$ and $i\in\mathbb{N}$. If $u$ is strictly $k$-local then $\mathop{\mathrm{\textsf{loc}}}\nolimits(w) \in \{ik-i+1, ik\}$.

Figures (8)

  • Figure 1: An illustration of the marking sequence $(\mathtt{b}, \mathtt{c}, \mathtt{e}, \mathtt{d}, \mathtt{a})$ with marking number of $4$ for the word $\beta = \mathtt{a} \mathtt{d} \mathtt{a} \mathtt{b} \mathtt{a} \mathtt{d} \mathtt{b} \mathtt{d} \mathtt{a} \mathtt{e} \mathtt{c} \mathtt{b} \mathtt{c} \mathtt{b}$ (left side), and the marking sequence $(\mathtt{d}, \mathtt{a}, \mathtt{b}, \mathtt{c}, \mathtt{e})$ with marking number of $3$ for the word $\beta$ (right side).
  • Figure 2: A graph $H$ and two possible linear arrangements with cuts of maximum size illustrated by vertical lines.
  • Figure 3: The path decomposition $(\{u, w, x\}, \{u, v, x\}, \{v, y, z\})$ for graph $H$ (see Figure \ref{['fig:exampleCutwidth']}) as a pd-marking scheme. White vertices are $\mathop{\mathrm{\texttt{open}}}\nolimits$, grey vertices are $\mathop{\mathrm{\texttt{active}}}\nolimits$, and black vertices are $\mathop{\mathrm{\texttt{closed}}}\nolimits$. In order to see that this is a pd-marking scheme, it is sufficient to observe that for every edge there is a step in the pd-marking scheme where both incident vertices are grey (i. e., $\mathop{\mathrm{\texttt{active}}}\nolimits$).
  • Figure 4: The graph $H_{\alpha, k}$ for $\alpha = \mathtt{a} \mathtt{b} \mathtt{c} \mathtt{b} \mathtt{c} \mathtt{d} \mathtt{b} \mathtt{a} \mathtt{d} \mathtt{a}$ and $k = 2$; an optimal linear arrangement of $H_{\alpha,k}$ with cutwidth $4$ induces the optimal marking sequence $(\mathtt{c}, \mathtt{b}, \mathtt{d}, \mathtt{a})$ for $\alpha$ with marking number $2$.
  • Figure 5: A graph $H$ and its multigraph $H'$ obtained by doubling the edges; the edge labels describe a Eulerian cycle that starts and ends in $x$. Deleting the edge $(v, x)$ in this cycle yields the word $\alpha_{(v, x)} = xwuxwuxvuvyzvyzv$, which has an optimal marking sequence $(w, u, x, v, y, z)$ with marking number $3$, and, thus, induces an optimal linear arrangement of $H$ with cutwidth $3$.
  • ...and 3 more figures

Theorems & Definitions (59)

  • Lemma 3.1
  • Lemma 3.2
  • Lemma 3.3
  • Lemma 4.1
  • proof
  • Lemma 4.2
  • proof
  • Lemma 4.3
  • proof
  • Lemma 4.4
  • ...and 49 more