Density of group languages in shift spaces
Valérie Berthé, Herman Goulet-Ouellet, Carl-Fredrik Nyberg-Brodda, Dominique Perrin, Karl Petersen
TL;DR
The article develops a unified ergodic-theoretic framework to study the density of group languages in shift spaces. By translating densities into ergodic sums on skew products G ⋊_φ X, it proves existence of δ_μ(L) for all group languages and derives computable formulas, with a particularly clean version δ_μ(L) = |K|/|G| when ν×μ is ergodic on the skew product. The work then specializes to shifts of finite type and minimal shifts, introducing φ-irreducibility and modular cobounding maps to characterize ergodicity and minimality; it provides a density formula in terms of minimal cobounding maps and explores how return words, bifix codes, and coboundaries control the structure of minimal skew products. The results yield concrete equidistribution statements for classical systems (Fibonacci, Sturmian, dendric shifts) and show that, under natural irreducibility hypotheses, densities are often universal across group languages. These insights connect automata-theoretic language density with ergodic theory, offering practical tools for computing pattern frequencies in symbolic dynamical systems.
Abstract
The density of a rational language can be understood as the frequency of some "pattern" in the shift space, for example a pattern like "words with an even number of a given letter." We study the density of group languages, i.e. rational languages recognized by morphisms onto finite groups, inside shift spaces. We show that the density with respect to any given ergodic measure on a shift space exists for every group language, because it can be computed by using any ergodic lift of the given measure to some skew product between the shift space and the recognizing group. We then further study densities in shifts of finite type (with a suitable notion of irreducibility), and then in minimal shifts. In the latter case, we obtain a closed formula for the density under the condition that the skew product has minimal closed invariant subsets which are ergodic under the product of the original measure and the uniform probability measure on the group. The formula is derived in part from a characterization of minimal closed invariant subsets for skew products relying on notions of cocycles and coboundaries. In the case where the whole skew product is ergodic under the product measure, then the density is just the cardinality of the subset of the group which defines the language divided by the cardinality of the group. Moreover, we provide sufficient conditions for the skew product to have minimal closed invariant subsets that are ergodic under the product measure. Finally, we investigate the link between minimal closed invariant subsets, return words and bifix codes.
