Many Flavors of Edit Distance

Sudatta Bhattacharya; Sanjana Dey; Elazar Goldenberg; Michal Koucký

Many Flavors of Edit Distance

Sudatta Bhattacharya, Sanjana Dey, Elazar Goldenberg, Michal Koucký

TL;DR

This paper demonstrates the capability to reduce questions regarding string similarity over arbitrary alphabets to equivalent questions over a binary alphabet and illustrates how to transform questions concerning indel distance into equivalent questions based on edit distance.

Abstract

Several measures exist for string similarity, including notable ones like the edit distance and the indel distance. The former measures the count of insertions, deletions, and substitutions required to transform one string into another, while the latter specifically quantifies the number of insertions and deletions. Many algorithmic solutions explicitly address one of these measures, and frequently techniques applicable to one can also be adapted to work with the other. In this paper, we investigate whether there exists a standardized approach for applying results from one setting to another. Specifically, we demonstrate the capability to reduce questions regarding string similarity over arbitrary alphabets to equivalent questions over a binary alphabet. Furthermore, we illustrate how to transform questions concerning indel distance into equivalent questions based on edit distance. This complements an earlier result of Tiskin (2007) which addresses the inverse direction.

Many Flavors of Edit Distance

TL;DR

Abstract

Many Flavors of Edit Distance

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (31)