Synthesizing Access Control Policies using Large Language Models
Adarsh Vatsa, Pratyush Patel, William Eiers
TL;DR
This work tackles the problem of synthesizing cloud access control policies from specification inputs using large language models in a zero-shot setting. It analyzes three prompt styles—concrete requests, coarse natural-language descriptions, and fine-grained syntax—to generate AWS IAM-like policies and verifies their correctness via SMT-based reasoning and permissiveness comparisons. Results show that concrete prompts yield strong but not universal accuracy (e.g., ~83.82% on 100 cases), while coarse prompts often lead to incomparable outputs due to description imprecision; fine-grained syntax prompts improve precision in the majority of cases, highlighting the value of structured specifications. The study suggests that combining structured policy syntax with LLMs could enable reliable policy generation and points to future work on transfer learning, larger datasets, and hybrid human-readable formal specifications for robust guarantees.
Abstract
Cloud compute systems allow administrators to write access control policies that govern access to private data. While policies are written in convenient languages, such as AWS Identity and Access Management Policy Language, manually written policies often become complex and error prone. In this paper, we investigate whether and how well Large Language Models (LLMs) can be used to synthesize access control policies. Our investigation focuses on the task of taking an access control request specification and zero-shot prompting LLMs to synthesize a well-formed access control policy which correctly adheres to the request specification. We consider two scenarios, one which the request specification is given as a concrete list of requests to be allowed or denied, and another in which a natural language description is used to specify sets of requests to be allowed or denied. We then argue that for zero-shot prompting, more precise and structured prompts using a syntax based approach are necessary and experimentally show preliminary results validating our approach.
