VeriPlan: Integrating Formal Verification and LLMs into End-User Planning

Christine Lee; David Porfirio; Xinyu Jessica Wang; Kevin Zhao; Bilge Mutlu

VeriPlan: Integrating Formal Verification and LLMs into End-User Planning

Christine Lee, David Porfirio, Xinyu Jessica Wang, Kevin Zhao, Bilge Mutlu

TL;DR

VeriPlan tackles the challenge of deploying LLM-driven end-user planning by introducing a formal verification layer that uses model checking to enforce user-defined constraints on LLM outputs. The system couples a rule translator, flexibility sliders, and an external model checker with an iterative LLM planning loop to provide deterministic boundaries and transparent feedback, while preserving user control and creativity. A within-subject user study (n=12) shows that VeriPlan improves perceived output quality, usefulness, satisfaction, and efficiency, with the model checker enabling clearer guidance and faster plan convergence. The work offers practical design implications for integrating verification and multi-dimensional user control into LLM systems, and demonstrates how external verification can substantially increase reliability and user trust in everyday planning tasks.

Abstract

Automated planning is traditionally the domain of experts, utilized in fields like manufacturing and healthcare with the aid of expert planning tools. Recent advancements in LLMs have made planning more accessible to everyday users due to their potential to assist users with complex planning tasks. However, LLMs face several application challenges within end-user planning, including consistency, accuracy, and user trust issues. This paper introduces VeriPlan, a system that applies formal verification techniques, specifically model checking, to enhance the reliability and flexibility of LLMs for end-user planning. In addition to the LLM planner, VeriPlan includes three additional core features -- a rule translator, flexibility sliders, and a model checker -- that engage users in the verification process. Through a user study (n=12), we evaluate VeriPlan, demonstrating improvements in the perceived quality, usability, and user satisfaction of LLMs. Our work shows the effective integration of formal verification and user-control features with LLMs for end-user planning tasks.

VeriPlan: Integrating Formal Verification and LLMs into End-User Planning

TL;DR

Abstract

VeriPlan: Integrating Formal Verification and LLMs into End-User Planning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)