Optimal Program Synthesis via Abstract Interpretation

Stephen Mell; Steve Zdancewic; Osbert Bastani

Optimal Program Synthesis via Abstract Interpretation

Stephen Mell, Steve Zdancewic, Osbert Bastani

TL;DR

The paper addresses optimal synthesis of neurosymbolic programs with real-valued constants by introducing a unified framework that couples A* search with an admissible heuristic derived from abstract interpretation. It expands the search space to generalized partial programs and uses interval domains to overapproximate both program semantics and the objective, enabling provable pruning of suboptimal branches. The authors instantiate the framework on two DSLs (NEAR and Quivr) and demonstrate superior scalability compared to SMT-based optima and BFS baselines, while maintaining optimality guarantees within a user-defined tolerance $\bepsilon$. This approach offers a principled, scalable path for synthesizing high-quality programs for trajectory labeling and related data-processing tasks. The work advances the state of neurosymbolic synthesis by providing a general, provably optimal framework that leverages abstract interpretation to guide search.

Abstract

We consider the problem of synthesizing programs with numerical constants that optimize a quantitative objective, such as accuracy, over a set of input-output examples. We propose a general framework for optimal synthesis of such programs in a given domain specific language (DSL), with provable optimality guarantees. Our framework enumerates programs in a general search graph, where nodes represent subsets of concrete programs. To improve scalability, it uses A* search in conjunction with a search heuristic based on abstract interpretation; intuitively, this heuristic establishes upper bounds on the value of subtrees in the search graph, enabling the synthesizer to identify and prune subtrees that are provably suboptimal. In addition, we propose a natural strategy for constructing abstract transformers for monotonic semantics, which is a common property for components in DSLs for data classification. Finally, we implement our approach in the context of two such existing DSLs, demonstrating that our algorithm is more scalable than existing optimal synthesizers.

Optimal Program Synthesis via Abstract Interpretation

TL;DR

. This approach offers a principled, scalable path for synthesizing high-quality programs for trajectory labeling and related data-processing tasks. The work advances the state of neurosymbolic synthesis by providing a general, provably optimal framework that leverages abstract interpretation to guide search.

Abstract

Paper Structure (21 sections, 2 theorems, 55 equations, 2 figures, 1 table, 1 algorithm)

This paper contains 21 sections, 2 theorems, 55 equations, 2 figures, 1 table, 1 algorithm.

Introduction
Motivating Example
Optimal Synthesis via Abstract Interpretation
Problem Formulation
$A^*$ Synthesis via Abstract Interpretation
Instantiation for Interval Domains
Interval Domains from Partial Orders
Interval Transformers for Monotone Functions
Interval Transfomers for Monotone Objectives
Partial Programs with Interval Constraints
Interval Transformers for Partial Programs with Interval Constraints
Implementation
NEAR DSL for Trajectory Labeling
Quivr DSL For Trajectory Queries
Abstract $F_1$ Score
...and 6 more sections

Key Result

theorem 1

If Algorithm alg:main returns an abstract program $\hat{p}$, then $\hat{p}$ is $\epsilon$-optimal---i.e. $\forall p \in \gamma(\hat{p})$, $\phi^*_Z - \phi(p,Z) \leq \epsilon$.

Figures (2)

Figure 1: A frame from a video of two mice interacting calms21; the mice are very close together, and are exhibiting the "sniff" behavior. The video has been processed using deep neural networks to produce certain keypoints, which are shown.
Figure 2: The time (seconds, log scale) to identify the optimal program and prove its optimality, for our approach (blue, solid) and an SMT solver (red, dashed), as a function of the size of the training dataset, for two different tasks.

Theorems & Definitions (9)

definition 1
definition 2
theorem 1
definition 3
definition 4
definition 5
lemma 1
definition 6
definition 7

Optimal Program Synthesis via Abstract Interpretation

TL;DR

Abstract

Optimal Program Synthesis via Abstract Interpretation

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (9)