Retrieval-Augmented Framework for LLM-Based Clinical Decision Support

Leon Garza; Anantaa Kotal; Michael A. Grasso; Emre Umucu

Retrieval-Augmented Framework for LLM-Based Clinical Decision Support

Leon Garza, Anantaa Kotal, Michael A. Grasso, Emre Umucu

TL;DR

The paper tackles the challenge of safe, context-aware prescribing in the face of heterogeneous EHR data by proposing a retrieval-augmented generation framework that grounds LLM outputs in semantically similar historical cases. It formalizes the problem as a retrieval-augmented conditional generation task, introduces a modular five-component architecture for fusing structured and unstructured EHR signals, and implements a retrieval and generation pipeline that pairs patient context with precedent cases to produce actionable treatment recommendations. The evaluation on a real ED dataset demonstrates competitive predictive performance relative to traditional baselines, with strong clinical consistency and interpretability due to precedent grounding. The work highlights practical considerations for deployment, including transparency, safety, and alignment with clinical workflows, and argues for the viability of LLM-assisted prescribing under appropriate validation and human-in-the-loop oversight.

Abstract

The increasing complexity of clinical decision-making, alongside the rapid expansion of electronic health records (EHR), presents both opportunities and challenges for delivering data-informed care. This paper proposes a clinical decision support system powered by Large Language Models (LLMs) to assist prescribing clinicians. The system generates therapeutic suggestions by analyzing historical EHR data, including patient demographics, presenting complaints, clinical symptoms, diagnostic information, and treatment histories. The framework integrates natural language processing with structured clinical inputs to produce contextually relevant recommendations. Rather than replacing clinician judgment, it is designed to augment decision-making by retrieving and synthesizing precedent cases with comparable characteristics, drawing on local datasets or federated sources where applicable. At its core, the system employs a retrieval-augmented generation (RAG) pipeline that harmonizes unstructured narratives and codified data to support LLM-based inference. We outline the system's technical components, including representation representation alignment and generation strategies. Preliminary evaluations, conducted with de-identified and synthetic clinical datasets, examine the clinical plausibility and consistency of the model's outputs. Early findings suggest that LLM-based tools may provide valuable decision support in prescribing workflows when appropriately constrained and rigorously validated. This work represents an initial step toward integration of generative AI into real-world clinical decision-making with an emphasis on transparency, safety, and alignment with established practices.

Retrieval-Augmented Framework for LLM-Based Clinical Decision Support

TL;DR

Abstract

Retrieval-Augmented Framework for LLM-Based Clinical Decision Support

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (1)