Encrypted Prompt: Securing LLM Applications Against Unauthorized Actions

Shih-Han Chan

Encrypted Prompt: Securing LLM Applications Against Unauthorized Actions

Shih-Han Chan

TL;DR

This paper addresses the risk of prompt injection and API misuse in LLM-enabled agents by introducing an Encrypted Prompt that attaches to each user prompt. The Encrypted Prompt comprises a delimiter, a dynamic permission set, and a public key, enabling server-side verification to ensure only actions within the permitted scope are executed. It provides a flexible, application-level defense that can adapt permissions based on user, device, and server status, and can integrate with other safety approaches. While offering practical protection without retraining, it also notes limitations such as the need for on-device permission management and potential gaps for inherently authorized harmful actions within the permission model.

Abstract

Security threats like prompt injection attacks pose significant risks to applications that integrate Large Language Models (LLMs), potentially leading to unauthorized actions such as API misuse. Unlike previous approaches that aim to detect these attacks on a best-effort basis, this paper introduces a novel method that appends an Encrypted Prompt to each user prompt, embedding current permissions. These permissions are verified before executing any actions (such as API calls) generated by the LLM. If the permissions are insufficient, the LLM's actions will not be executed, ensuring safety. This approach guarantees that only actions within the scope of the current permissions from the LLM can proceed. In scenarios where adversarial prompts are introduced to mislead the LLM, this method ensures that any unauthorized actions from LLM wouldn't be executed by verifying permissions in Encrypted Prompt. Thus, threats like prompt injection attacks that trigger LLM to generate harmful actions can be effectively mitigated.

Encrypted Prompt: Securing LLM Applications Against Unauthorized Actions

TL;DR

Abstract

Encrypted Prompt: Securing LLM Applications Against Unauthorized Actions

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)