LangSAT: A Novel Framework Combining NLP and Reinforcement Learning for SAT Solving

Muyu Pan; Matthew Walter; Dheeraj Kodakandla; Mahfuza Farooque

LangSAT: A Novel Framework Combining NLP and Reinforcement Learning for SAT Solving

Muyu Pan, Matthew Walter, Dheeraj Kodakandla, Mahfuza Farooque

TL;DR

LangSAT addresses the accessibility gap in SAT solving by enabling natural language inputs to be converted into CNF and solved with an RL-enhanced CDCL solver. It introduces Lang2Logic for NL-to-CNF and SmartSAT for PPO-based heuristic optimization, and demonstrates comparable performance to traditional CDCL on the uf20-91 dataset with improved consistency. The approach enables more user-friendly reasoning, formal verification, and debugging workflows. Future work includes expanding input modalities (e.g., code) and enriching global features to further boost solving efficiency.

Abstract

Our work presents a novel reinforcement learning (RL) based framework to optimize heuristic selection within the conflict-driven clause learning (CDCL) process, improving the efficiency of Boolean satisfiability (SAT) solving. The proposed system, LangSAT, bridges the gap between natural language inputs and propositional logic by converting English descriptions into Conjunctive Normal Form (CNF) expressions and solving them using an RL-enhanced CDCL SAT solver. Unlike existing SAT-solving platforms that require CNF as input, LangSAT enables users to input standard English descriptions, making SAT-solving more accessible. The framework comprises two key components: Lang2Logic, which translates English sentences into CNF expressions, and SmartSAT, an RL-based SAT solver. SmartSAT encodes clause-variable relationships as structured graph representations and extracts global features specific to the SAT problem. This implementation provides the RL agent with deeper contextual information, enabling SAT problems to be solved more efficiently. Lang2Logic was evaluated on diverse natural language inputs, processing descriptions up to 450 words. The generated CNFs were solved by SmartSAT, which demonstrated comparable performance to traditional CDCL heuristics with respect to solving time. The combined LangSAT framework offers a more accessible and scalable solution for SAT-solving tasks across reasoning, formal verification, and debugging.

LangSAT: A Novel Framework Combining NLP and Reinforcement Learning for SAT Solving

TL;DR

Abstract

LangSAT: A Novel Framework Combining NLP and Reinforcement Learning for SAT Solving

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)