HyPA-RAG: A Hybrid Parameter Adaptive Retrieval-Augmented Generation System for AI Legal and Policy Applications

Rishi Kalra; Zekun Wu; Ayesha Gulley; Airlie Hilliard; Xin Guan; Adriano Koshiyama; Philip Treleaven

HyPA-RAG: A Hybrid Parameter Adaptive Retrieval-Augmented Generation System for AI Legal and Policy Applications

Rishi Kalra, Zekun Wu, Ayesha Gulley, Airlie Hilliard, Xin Guan, Adriano Koshiyama, Philip Treleaven

TL;DR

HyPA-RAG tackles the limitations of LLMs in AI legal and policy tasks by introducing a query-complexity classifier for adaptive parameter tuning, and a hybrid retrieval stack that fuses dense, sparse, and knowledge-graph sources. Tested on NYC Local Law 144, the approach demonstrates improvements in retrieval fidelity, contextual precision, and overall correctness through a rigorous evaluation framework and targeted metrics. The work contributes a pragmatic, domain-adaptive RAG architecture, an MG-backed knowledge integration strategy, and comprehensive ablations highlighting the value of adaptivity in high-stakes legal contexts. These findings support scalable, accountable AI tooling for policy and legal decision-support while outlining ethical considerations and future directions for domain-aware RAG systems.

Abstract

Large Language Models (LLMs) face limitations in AI legal and policy applications due to outdated knowledge, hallucinations, and poor reasoning in complex contexts. Retrieval-Augmented Generation (RAG) systems address these issues by incorporating external knowledge, but suffer from retrieval errors, ineffective context integration, and high operational costs. This paper presents the Hybrid Parameter-Adaptive RAG (HyPA-RAG) system, designed for the AI legal domain, with NYC Local Law 144 (LL144) as the test case. HyPA-RAG integrates a query complexity classifier for adaptive parameter tuning, a hybrid retrieval approach combining dense, sparse, and knowledge graph methods, and a comprehensive evaluation framework with tailored question types and metrics. Testing on LL144 demonstrates that HyPA-RAG enhances retrieval accuracy, response fidelity, and contextual precision, offering a robust and adaptable solution for high-stakes legal and policy applications.

HyPA-RAG: A Hybrid Parameter Adaptive Retrieval-Augmented Generation System for AI Legal and Policy Applications

TL;DR

Abstract

Paper Structure (42 sections, 8 figures, 9 tables)

This paper contains 42 sections, 8 figures, 9 tables.

Introduction
Background and Related Work
System Design
AI Legal and Policy Corpus
Performance Evaluation
Dataset Generation
Evaluation Metrics
Correctness Evaluation
Chunking Method
Query Complexity Classifier
Training Data
Model Training
Classifier Results
RAG System Architecture
Parameter-Adaptive RAG (PA-RAG)
...and 27 more sections

Figures (8)

Figure 1: Hybrid Parameter Adaptive RAG (HyPA-RAG) System Diagram
Figure 2: Spearman Coefficient Comparison, showing the correlation between model performance and human evaluation.
Figure 3: RAG Evaluation Metrics for Sentence-Level, Semantic, and Pattern-Based Chunking Methods
Figure 4: Demo screenshots showing each key stage of the user experience.
Figure 5: Overall RAG Development Workflow Diagram
...and 3 more figures

HyPA-RAG: A Hybrid Parameter Adaptive Retrieval-Augmented Generation System for AI Legal and Policy Applications

TL;DR

Abstract

HyPA-RAG: A Hybrid Parameter Adaptive Retrieval-Augmented Generation System for AI Legal and Policy Applications

Authors

TL;DR

Abstract

Table of Contents

Figures (8)