Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems

Sarbartha Banerjee; Prateek Sahu; Anjo Vahldiek-Oberwagner; Jose Sanchez Vicarte; Mohit Tiwari

Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems

Sarbartha Banerjee, Prateek Sahu, Anjo Vahldiek-Oberwagner, Jose Sanchez Vicarte, Mohit Tiwari

Abstract

Rapid progress in generative AI has given rise to Compound AI systems - pipelines comprised of multiple large language models (LLM), software tools and database systems. Compound AI systems are constructed on a layered traditional software stack running on a distributed hardware infrastructure. Many of the diverse software components are vulnerable to traditional security flaws documented in the Common Vulnerabilities and Exposures (CVE) database, while the underlying distributed hardware infrastructure remains exposed to timing attacks, bit-flip faults, and power-based side channels. Today, research targets LLM-specific risks like model extraction, training data leakage, and unsafe generation -- overlooking the impact of traditional system vulnerabilities. This work investigates how traditional software and hardware vulnerabilities can complement LLM-specific algorithmic attacks to compromise the integrity of a compound AI pipeline. We demonstrate two novel attacks that combine system-level vulnerabilities with algorithmic weaknesses: (1) Exploiting a software code injection flaw along with a guardrail Rowhammer attack to inject an unaltered jailbreak prompt into an LLM, resulting in an AI safety violation, and (2) Manipulating a knowledge database to redirect an LLM agent to transmit sensitive user data to a malicious application, thus breaching confidentiality. These attacks highlight the need to address traditional vulnerabilities; we systematize the attack primitives and analyze their composition by grouping vulnerabilities by their objective and mapping them to distinct stages of an attack lifecycle. This approach enables a rigorous red-teaming exercise and lays the groundwork for future defense strategies.

Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems

Abstract

Paper Structure (22 sections, 8 figures, 1 table)

This paper contains 22 sections, 8 figures, 1 table.

Introduction
Security of Compound AI Systems
Components in Compound AI Pipeline
Cross-stack attack gadgets
Attack Gadget Systematization
Security properties
Classification of attacker capability
Classification of cross-stack attack vectors
Attack Gadget Composition
Motivation for Attack Gadget Composition
Composition of cross-stack attack gadgets
Formation of concrete attack chains
Case Study: Attack Gadget Composition to violate AI Safety
Attacker threat model
Step 1: Subverting the query paraphrasing
...and 7 more sections

Figures (8)

Figure 1: The building blocks of a Compound AI pipeline with cross-stack attack gadgets comprising of adversarial attacks, software vulnerabilities and hardware side-channels.
Figure 2: The building blocks of a Compound AI pipeline with cross-stack attack gadgets comprising of adversarial attacks, software vulnerabilities and hardware side-channels.
Figure 3: The building blocks of a Compound AI pipeline with cross-stack attack gadgets comprising of adversarial attacks, software vulnerabilities and hardware side-channels.
Figure 4: Cascade framework: Given an attacker’s objective, capability, and query, the framework uses LLM-based reasoning to retrieve candidate gadgets, evaluate them against the target AI pipeline, and iteratively refine attack chains -- instantiating open-source testbeds when needed (e.g., for GCG jailbreaks) -- until success or timeout.
Figure 5: Efficacy of guardrail and language model against harmful prompts. Guardrails are trained to filter unsafe queries but do not modify the query themselves. Guardrails perform better for certain categories with an overall efficiency of being able to block 63% queries that generative models fail to stop.
...and 3 more figures

Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems

Abstract

Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems

Authors

Abstract

Table of Contents

Figures (8)