CF-GKAT: Efficient Validation of Control-Flow Transformations
Cheng Zhang, Tobias Kappé, David E. Narváez, Nico Naus
TL;DR
CF-GKAT extends Guarded Kleene Algebra with Tests to address non-local control flow and embedded primitive semantics by introducing indicator variables and a continuation-based semantics. It constructs CF-GKAT automata, lowers them to GKAT automata, and decides trace equivalence via GKAT bisimilarity, achieving near-linear performance in program size for fixed test variables. The approach enables reliable validation of control-flow transformations such as goto-elimination and decompiler outputs, demonstrated through a GNU Coreutils case study and a Ghidra-based decompilation pipeline. This work broadens the applicability of Kleene-algebra-based verification to real-world control-flow challenges, offering a practical verification tool for decompilation and control-flow restructuring tasks.
Abstract
Guarded Kleene Algebra with Tests (GKAT) provides a sound and complete framework to reason about trace equivalence between simple imperative programs. However, there are still several notable limitations. First, GKAT is completely agnostic with respect to the meaning of primitives, to keep equivalence decidable. Second, GKAT excludes non-local control flow such as goto, break, and return. To overcome these limitations, we introduce Control-Flow GKAT (CF-GKAT), a system that allows reasoning about programs that include non-local control flow as well as hardcoded values. CF-GKAT is able to soundly and completely verify trace equivalence of a larger class of programs, while preserving the nearly-linear efficiency of GKAT. This makes CF-GKAT suitable for the verification of control-flow manipulating procedures, such as decompilation and goto-elimination. To demonstrate CF-GKAT's abilities, we validated the output of several highly non-trivial program transformations, such as Erosa and Hendren's goto-elimination procedure and the output of Ghidra decompiler. CF-GKAT opens up the application of Kleene Algebra to a wider set of challenges, and provides an important verification tool that can be applied to the field of decompilation and control-flow transformation.
