Opus: A Quantitative Framework for Workflow Evaluation
Alan Seroul, Théo Fagnoni, Inès Adnani, Dana O. Mohamed, Phillip Kingston
TL;DR
The paper introduces the Opus Workflow Evaluation Framework, a probabilistic-normative method to quantify AI-driven Workflows by combining a stochastic Reward with normative Penalties that capture structural and signal-related quality. Workflows are modeled as DAGs of Tasks with probabilistic success, resource use, and gains, and penalties (Ch, Cp, Ob, Ih) are aggregated into CIP and SIP to guide optimization via a two-stage process that first maximizes Reward then minimizes the overall Penalty. A concrete case study on customer-support ticket automation demonstrates how the framework ranks Candidate Workflows and highlights how topology, observability, and signal quality influence selection beyond raw gain. The approach is positioned for integration with Reinforcement Learning, enabling autonomous workflow discovery and continuous improvement in live automation systems. Overall, Opus provides a principled, scalable foundation for objective comparison, optimization, and self-improvement of complex AI-driven workflows in uncertain environments.
Abstract
This paper introduces the Opus Workflow Evaluation Framework, a probabilistic-normative formulation for quantifying Workflow quality and efficiency. It integrates notions of correctness, reliability, and cost into a coherent mathematical model that enables direct comparison, scoring, and optimization of Workflows. The framework combines the Opus Workflow Reward, a probabilistic function estimating expected performance through success likelihood, resource usage, and output gain, with the Opus Workflow Normative Penalties, a set of measurable functions capturing structural and informational quality across Cohesion, Coupling, Observability, and Information Hygiene. It supports automated Workflow assessment, ranking, and optimization within modern automation systems such as Opus and can be integrated into Reinforcement Learning loops to guide Workflow discovery and refinement. In this paper, we introduce the Opus Workflow Reward model that formalizes Workflow success as a probabilistic expectation over costs and outcomes. We define measurable Opus Workflow Normative Penalties capturing structural, semantic, and signal-related properties of Workflows. Finally, we propose a unified optimization formulation for identifying and ranking optimal Workflows under joint Reward-Penalty trade-offs.
