Table of Contents
Fetching ...

Exponent-Strings and Their Edit Distance

Ingyu Baek

TL;DR

The paper introduces exponent-strings that extend traditional strings by allowing real-number exponents, enabling explicit representation of continuous attributes like duration in phonetic transcription. It formalizes $S$-exponent-strings, extends to $\mathbb{R}^+$-exponent-strings, and defines exp-edit distance as a continuous generalization of string edit distance, backed by a robust mathematical framework and a corresponding algorithm for the $\mathbb{Q}^+$ case. A central result is that a minimum-cost exp-edit sequence exists (Theorem inf-min), and that when restricted to $\mathbb{N}$-exponent-strings, exp-edit distance coincides with the classical string edit distance (Corollary), tying the new theory to the traditional discrete setting. The algorithmic development includes a contraction-based reduction, a translation to standard string edit distance, and a complexity analysis, with practical implications for tasks like phonetic similarity assessment and paraphasia detection where precise duration information matters.

Abstract

An exponent-string is an extension of traditional strings that can incorporate real-numbered exponents, indicating the quantity of characters. This novel representation overcomes the limitations of traditional discrete string by enabling precise data representation for applications such as phonetic transcription that contains sound duration. Although applications of exponent-string are focused on exponent-string with real-numbered exponents, formal definition uses arbitrary semigroup. For any semigroup $S$, $S$-exponent-strings are allowed to have elements of $S$ as exponents. We investigate algebraic properties of $S$-exponent-strings and further justify $\mathbb{R}^+$-exponent-string is a natural extension of the string. Motivated by the problem of calculating the similarity between spoken phone sequence and correct phone sequence, we develop exp-edit distance -- a specialized metric designed to measure the similarity between $\mathbb{R}^+$-exponent-strings. By extending the traditional string edit distance to handle continuous values, exp-edit distance deals with $\mathbb{R}^+$-exponent-strings that embody both discrete and continuous properties. Our exploration includes a rigorous mathematical formulation of exp-edit distance and an algorithm to compute it.

Exponent-Strings and Their Edit Distance

TL;DR

The paper introduces exponent-strings that extend traditional strings by allowing real-number exponents, enabling explicit representation of continuous attributes like duration in phonetic transcription. It formalizes -exponent-strings, extends to -exponent-strings, and defines exp-edit distance as a continuous generalization of string edit distance, backed by a robust mathematical framework and a corresponding algorithm for the case. A central result is that a minimum-cost exp-edit sequence exists (Theorem inf-min), and that when restricted to -exponent-strings, exp-edit distance coincides with the classical string edit distance (Corollary), tying the new theory to the traditional discrete setting. The algorithmic development includes a contraction-based reduction, a translation to standard string edit distance, and a complexity analysis, with practical implications for tasks like phonetic similarity assessment and paraphasia detection where precise duration information matters.

Abstract

An exponent-string is an extension of traditional strings that can incorporate real-numbered exponents, indicating the quantity of characters. This novel representation overcomes the limitations of traditional discrete string by enabling precise data representation for applications such as phonetic transcription that contains sound duration. Although applications of exponent-string are focused on exponent-string with real-numbered exponents, formal definition uses arbitrary semigroup. For any semigroup , -exponent-strings are allowed to have elements of as exponents. We investigate algebraic properties of -exponent-strings and further justify -exponent-string is a natural extension of the string. Motivated by the problem of calculating the similarity between spoken phone sequence and correct phone sequence, we develop exp-edit distance -- a specialized metric designed to measure the similarity between -exponent-strings. By extending the traditional string edit distance to handle continuous values, exp-edit distance deals with -exponent-strings that embody both discrete and continuous properties. Our exploration includes a rigorous mathematical formulation of exp-edit distance and an algorithm to compute it.
Paper Structure (12 sections, 18 theorems, 15 equations, 5 figures, 1 table, 2 algorithms)

This paper contains 12 sections, 18 theorems, 15 equations, 5 figures, 1 table, 2 algorithms.

Key Result

Proposition 1

For an arbitrary nonempty subset $X\subseteq \mathbb{R}$, if $X$ has a lower bound in real number, then $X$ also has the greatest lower bound or the infimum in real number.

Figures (5)

  • Figure 1: A pictorial illustration of four $\mathbb{R}^+$-exponent-strings; $\mathtt{ca^{0.5}t^{0.9}}$, $\mathtt{a^{1.4}t^{0.7}e^{0.8}}$, $\mathtt{a^1}$ and $\mathtt{ca^{2.5}k^{0.8}e^{0.7}}$, where the sound duration of each character is written as exponents. Note that $\mathtt{a}$'s in words ate and cake are long-vowels whereas $a$ in cat is a short-vowel.
  • Figure 2: An illustration that compares an exp-matching and string alignment. From the left, each picture depicts an exp-matching $E$ from $ab^{2.5}c$ to $cb^{1.5}d$, an alignment from $abbac$ to $abbda$ and finally, an alignment of $E$.
  • Figure 3: $a^{0.1}b^{1.9}$ and $a^{0.3}b^{1.7}$ are infixes corresponding to the line segment in the figure.
  • Figure 4: Original exp-matching from $p_1=a^{1.5}cb$ to $p_2=cb^{1.5}d$ is shown at the top. From left-upside to right-downside, each exp-matching after each step of the procedure is shown. The white circle is used to depict the boundary of multiple continuous line segments.
  • Figure 5: This can be valid exp-matching in $T_1$, if exp-matching in $T_0$ are covered by $B=\{b_{1,1},b_{1,2},b_{1,3},b_{3,3}\}$. $x_1,x_2, x_3$ and $x_4$ denote length of each line segment.

Theorems & Definitions (43)

  • Proposition 1: Jackson and Agunwamba JacksonA77
  • Definition 1
  • Definition 2
  • Definition 3: String edit distance
  • Definition 4: Alignment
  • Definition 5: $S$-exponent-string
  • Lemma 1
  • proof
  • Proposition 2
  • proof
  • ...and 33 more