Authenticated Workflows: A Systems Approach to Protecting Agentic AI
Mohan Rajagopalan, Vinay Rao
TL;DR
The paper addresses the security gaps in enterprise agentic AI by introducing authenticated workflows as a complete trust layer that guards four fundamental boundaries: prompts, tools, data, and context. It combines cryptographic integrity with runtime policy enforcement, delivering deterministic security through independent policy enforcement points and attestations of workflow progress. MAPL provides an AI-native policy language enabling scalable, hierarchical, and attestable policies that reduce policy specification from $O(M \times N)$ to $O(\log M + N)$. A universal security runtime integrates nine heterogeneous frameworks via thin adapters, supported by formal proofs of completeness and soundness and empirical validation showing 100% recall, 0% false positives across 174 test cases, and protection against notable production CVEs. The approach promises practical deployment by bridging enterprise IAM, ensuring boundary-level integrity, and enabling provable, auditable, and scalable agentic AI in production.
Abstract
Agentic AI systems automate enterprise workflows but existing defenses--guardrails, semantic filters--are probabilistic and routinely bypassed. We introduce authenticated workflows, the first complete trust layer for enterprise agentic AI. Security reduces to protecting four fundamental boundaries: prompts, tools, data, and context. We enforce intent (operations satisfy organizational policies) and integrity (operations are cryptographically authentic) at every boundary crossing, combining cryptographic elimination of attack classes with runtime policy enforcement. This delivers deterministic security--operations either carry valid cryptographic proof or are rejected. We introduce MAPL, an AI-native policy language that expresses agentic constraints dynamically as agents evolve and invocation context changes, scaling as O(log M + N) policies versus O(M x N) rules through hierarchical composition with cryptographic attestations for workflow dependencies. We prove practicality through a universal security runtime integrating nine leading frameworks (MCP, A2A, OpenAI, Claude, LangChain, CrewAI, AutoGen, LlamaIndex, Haystack) through thin adapters requiring zero protocol modifications. Formal proofs establish completeness and soundness. Empirical validation shows 100% recall with zero false positives across 174 test cases, protection against 9 of 10 OWASP Top 10 risks, and complete mitigation of two high impact production CVEs.
