On the Number of Non-equivalent Parameterized Squares in a String

Rikuya Hamai; Kazushi Taketsugu; Yuto Nakashima; Shunsuke Inenaga; Hideo Bannai

On the Number of Non-equivalent Parameterized Squares in a String

Rikuya Hamai, Kazushi Taketsugu, Yuto Nakashima, Shunsuke Inenaga, Hideo Bannai

TL;DR

This work tackles the problem of bounding the number of non-equivalent parameterized squares in a string under parameterized equivalence. It introduces tighter parameterized periodicity lemmas and a refined counting strategy to prove that, for a string of length $n$ over an alphabet of size $σ$, the number of non-equivalent parameterized squares satisfies $PS(s) < σ n$. This significantly improves the previous upper bound of $2 σ! n$ and tightens our understanding of parameterized repetition structures in strings. The results rely on improved analyses of p-periods, substring alphabet sizes, and overlap behavior, with potential implications for broader pattern-matching and combinatorics-on-words problems in the parameterized setting.

Abstract

A string $s$ is called a parameterized square when $s = xy$ for strings $x$, $y$ and $x$ and $y$ are parameterized equivalent. Kociumaka et al. showed the number of parameterized squares, which are non-equivalent in parameterized equivalence, in a string of length $n$ that contains $σ$ distinct characters is at most $2 σ! n$ [TCS 2016]. In this paper, we show that the maximum number of non-equivalent parameterized squares is less than $σn$, which significantly improves the best-known upper bound by Kociumaka et al.

On the Number of Non-equivalent Parameterized Squares in a String

TL;DR

over an alphabet of size

, the number of non-equivalent parameterized squares satisfies

. This significantly improves the previous upper bound of

and tightens our understanding of parameterized repetition structures in strings. The results rely on improved analyses of p-periods, substring alphabet sizes, and overlap behavior, with potential implications for broader pattern-matching and combinatorics-on-words problems in the parameterized setting.

Abstract

A string

is called a parameterized square when

for strings

and

are parameterized equivalent. Kociumaka et al. showed the number of parameterized squares, which are non-equivalent in parameterized equivalence, in a string of length

that contains

distinct characters is at most

[TCS 2016]. In this paper, we show that the maximum number of non-equivalent parameterized squares is less than

, which significantly improves the best-known upper bound by Kociumaka et al.

Paper Structure (7 sections, 13 theorems, 10 equations, 2 figures)

This paper contains 7 sections, 13 theorems, 10 equations, 2 figures.

Introduction
Preliminaries
Strings
Parameterized squares
Parameterized periods
Upper bound of non-equivalent parameterized squares
Open questions

Key Result

Lemma 1

Let $s$ be a string that satisfies $p \parallel_{f} s$ and $q \parallel_{g} s$. If $p+q \leq |s|$ and $f \circ g = g \circ f$, then $\gcd(p, q) \parallel_{\!}s$.

Figures (2)

Figure 1: Illustration for the proof of Theorem \ref{['lem:prefix-squares']}: $\sigma+1$ non-equivalent parameterized squares cannot begin at the same position.
Figure 2: Illustration for Lemma \ref{['lem:lowerbound']}: There are $\sigma$ non-equivalent parameterized square prefixes.

Theorems & Definitions (21)

Lemma 1: Lemma 3 of Apostolico2008
Lemma 2: Lemma 5 of Ideguchi2023
Lemma 3: Lemma 4 of Ideguchi2023
Theorem 1
Theorem 2
Lemma 4
proof
Corollary 1
Lemma 5
proof
...and 11 more

On the Number of Non-equivalent Parameterized Squares in a String

TL;DR

Abstract

On the Number of Non-equivalent Parameterized Squares in a String

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (21)