We've Got You Covered: Type-Guided Repair of Incomplete Input Generators

Patrick LaFontaine; Zhe Zhou; Ashish Mishra; Suresh Jagannathan; Benjamin Delaware

We've Got You Covered: Type-Guided Repair of Incomplete Input Generators

Patrick LaFontaine, Zhe Zhou, Ashish Mishra, Suresh Jagannathan, Benjamin Delaware

TL;DR

This work tackles the challenge of generating inputs that satisfy sparse preconditions in property-based testing by automatically repairing incomplete input generators. It introduces Cobb, a coverage-type guided enumerative synthesis pipeline that characterizes missing inputs with $ uut{b}{\phi}$ coverage types and patches generators using a bottom-up term enumeration driven by a cost model. The approach combines phase-based missing-coverage abduction, sketch localization with typed holes, and a lattice-based patch extraction to guarantee coverage completeness while preserving existing generator behavior. Empirical results across diverse datatypes (lists, trees, STLC terms) show Cobb can repair incomplete generators to achieve coverage-complete outputs, with varying performance depending on component sets and the chosen repair strategy, and it further demonstrates potential for sketch-based synthesis and static data generation in PBT workflows.

Abstract

Property-based testing (PBT) is a popular technique for automatically testing semantic properties of a program, specified as a pair of pre- and post-conditions. The efficacy of this approach depends on being able to quickly generate inputs that meet the precondition, in order to maximize the set of program behaviors that are probed. For semantically rich preconditions, purely random generation is unlikely to produce many valid inputs; when this occurs, users are forced to manually write their own specialized input generators. One common problem with handwritten generators is that they may be incomplete, i.e., they are unable to generate some values meeting the target precondition. This paper presents a novel program repair technique that patches an incomplete generator so that its range includes every valid input. Our approach uses a novel enumerative synthesis algorithm that leverages the recently developed notion of coverage types to characterize the set of missing test values as well as the coverage provided by candidate repairs. We have implemented a repair tool for OCaml generators, called Cobb, and used it to repair a suite of benchmarks drawn from the PBT literature.

We've Got You Covered: Type-Guided Repair of Incomplete Input Generators

TL;DR

Abstract

We've Got You Covered: Type-Guided Repair of Incomplete Input Generators

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (17)

Theorems & Definitions (27)