Towards Enforcing Company Policy Adherence in Agentic Workflows
Naama Zwerdling, David Boaz, Ella Rabinovich, Guy Uziel, David Amid, Ateret Anaby-Tavor
TL;DR
The paper tackles the problem of reliably enforcing company policies in agentic workflows powered by large language models. It introduces a deterministic, two-phase framework consisting of an offline buildtime policy-to-tool mapping (Tool-Policy Mapper) and a runtime guard-generation component (ToolGuards) that execute before each tool invocation. Through evaluation in the $\tau$-bench Airlines domain, the approach yields encouraging improvements in end-to-end policy adherence, demonstrating substantial gains over baseline, best-effort methods and offering a scalable path toward enterprise-grade deployment. The work provides a detailed lifecycle, evaluation methods, and practical guidance for integrating policy-aware enforcement into existing agentic frameworks, highlighting both potential benefits and real-world challenges that remain to be addressed.
Abstract
Large Language Model (LLM) agents hold promise for a flexible and scalable alternative to traditional business process automation, but struggle to reliably follow complex company policies. In this study we introduce a deterministic, transparent, and modular framework for enforcing business policy adherence in agentic workflows. Our method operates in two phases: (1) an offline buildtime stage that compiles policy documents into verifiable guard code associated with tool use, and (2) a runtime integration where these guards ensure compliance before each agent action. We demonstrate our approach on the challenging $τ$-bench Airlines domain, showing encouraging preliminary results in policy enforcement, and further outline key challenges for real-world deployments.
