SecureRAG-RTL: A Retrieval-Augmented, Multi-Agent, Zero-Shot LLM-Driven Framework for Hardware Vulnerability Detection

Touseef Hasan; Blessing Airehenbuwa; Nitin Pundir; Souvika Sarkar; Ujjwal Guin

SecureRAG-RTL: A Retrieval-Augmented, Multi-Agent, Zero-Shot LLM-Driven Framework for Hardware Vulnerability Detection

Touseef Hasan, Blessing Airehenbuwa, Nitin Pundir, Souvika Sarkar, Ujjwal Guin

TL;DR

This work proposes SecureRAG-RTL, a novel Retrieval-Augmented Generation (RAG)-based approach that significantly enhances LLM-based security verification of hardware designs, and integrates domain-specific retrieval with generative reasoning, enabling models to overcome inherent limitations in hardware security expertise.

Abstract

Large language models (LLMs) have shown remarkable capabilities in natural language processing tasks, yet their application in hardware security verification remains limited due to scarcity of publicly available hardware description language (HDL) datasets. This knowledge gap constrains LLM performance in detecting vulnerabilities within HDL designs. To address this challenge, we propose SecureRAG-RTL, a novel Retrieval-Augmented Generation (RAG)-based approach that significantly enhances LLM-based security verification of hardware designs. Our approach integrates domain-specific retrieval with generative reasoning, enabling models to overcome inherent limitations in hardware security expertise. We establish baseline vulnerability detection rates using prompt-only methods and then demonstrate that SecureRAG-RTL achieves substantial improvements across diverse LLM architectures, regardless of size. On average, our method increases detection accuracy by about 30%, highlighting its effectiveness in bridging domain knowledge gaps. For evaluation, we curated and annotated a benchmark dataset of 14 HDL designs containing real-world security vulnerabilities, which we will release publicly to support future research. These findings underscore the potential of RAG-driven augmentation to enable scalable, efficient, and accurate hardware security verification workflows.

SecureRAG-RTL: A Retrieval-Augmented, Multi-Agent, Zero-Shot LLM-Driven Framework for Hardware Vulnerability Detection

TL;DR

Abstract

Paper Structure (17 sections, 1 equation, 4 figures, 4 tables, 1 algorithm)

This paper contains 17 sections, 1 equation, 4 figures, 4 tables, 1 algorithm.

Introduction
Prior Work
LLMs for HDL Vulnerability Detection
Hardware Vulnerability Database
Retrieval-Augmented Generation
The SecureRAG-RTL Framework
Retrieval Phase
RTL Summary and Signature Extraction
CWE Knowledge Database
Hybrid Retrieval and Evaluation
Detection Phase
Experimental Setup
Dataset Construction
Models Evaluated
Evaluation Metric
...and 2 more sections

Figures (4)

Figure 1: The proposed SecureRAG-RTL pipeline for hardware vulnerability detection.
Figure 2: Detection Agent Prompt.
Figure 3: Detection agent responses when CWE is not found.
Figure 4: Detection agent responses when a CWE is found.

SecureRAG-RTL: A Retrieval-Augmented, Multi-Agent, Zero-Shot LLM-Driven Framework for Hardware Vulnerability Detection

TL;DR

Abstract

SecureRAG-RTL: A Retrieval-Augmented, Multi-Agent, Zero-Shot LLM-Driven Framework for Hardware Vulnerability Detection

Authors

TL;DR

Abstract

Table of Contents

Figures (4)