Permissioned LLMs: Enforcing Access Control in Large Language Models
Bargav Jayaraman, Virendra J. Marathe, Hamid Mozaffari, William F. Shen, Krishnaram Kenthapadi
TL;DR
This work formulates Permissioned LLMs (PermLLMs) to enforce enterprise data access controls within fine-tuned language models by mapping data-domain constraints to parameter subsets via PEFT adapters. It introduces a formalism around relevant responses and a new auditing metric, access advantage, with two instantiations—Domain Distinguishability Index (DDI) and Utility Gap Index (UGI)—to evaluate enforcement. Three PEFT-based mechanisms are proposed: Activate (one LoRA per domain), Merge (merge adapters for domain groups), and Union (train adapters on domain unions), with formal guarantees and auditing frameworks. Extensive experiments on five public datasets using Llama-3.1-8B and Mistral-0.1-7B demonstrate that Union yields the strongest domain separation (high DDI) and robust access control, while Evaluate metrics capture utility trade-offs across multi-domain scenarios. The findings highlight the viability of formal, auditable access control in PermLLMs for secure enterprise deployments, while noting scalability and hierarchy limitations as avenues for future work.
Abstract
In enterprise settings, organizational data is segregated, siloed and carefully protected by elaborate access control frameworks. These access control structures can completely break down if an LLM fine-tuned on the siloed data serves requests, for downstream tasks, from individuals with disparate access privileges. We propose Permissioned LLMs (PermLLM), a new class of LLMs that superimpose the organizational data access control structures on query responses they generate. We formalize abstractions underpinning the means to determine whether access control enforcement happens correctly over LLM query responses. Our formalism introduces the notion of a relevant response that can be used to prove whether a PermLLM mechanism has been implemented correctly. We also introduce a novel metric, called access advantage, to empirically evaluate the efficacy of a PermLLM mechanism. We introduce three novel PermLLM mechanisms that build on Parameter Efficient Fine-Tuning to achieve the desired access control. We furthermore present two instantiations of access advantage--(i) Domain Distinguishability Index (DDI) based on Membership Inference Attacks, and (ii) Utility Gap Index (UGI) based on LLM utility evaluation. We demonstrate the efficacy of our PermLLM mechanisms through extensive experiments on five public datasets (GPQA, RCV1, SimpleQA, WMDP, and PubMedQA), in addition to evaluating the validity of DDI and UGI metrics themselves for quantifying access control in LLMs.
