Retrieval Augmented Generation (RAG) for Fintech: Agentic Design and Evaluation
Thomas Cook, Richard Osuagwu, Liman Tsatiashvili, Vrynsia Vrynsia, Koustav Ghosal, Maraim Masoud, Riccardo Mattivi
TL;DR
The paper tackles the challenge of applying retrieval-augmented generation in fintech, where domain-specific ontologies and acronym-heavy content hinder standard RAG pipelines. It introduces an agentic RAG (A-RAG) with an Orchestrator that coordinates specialized agents for acronym resolution, sub-query generation, parallel retrieval, cross-encoder re-ranking, and QA-driven refinement, to enable iterative and domain-aware retrieval. Evaluated against a baseline RAG (B-RAG) on an enterprise fintech knowledge base, A-RAG achieves higher retrieval accuracy (62.35% vs 54.12%), and a broader notion of correctness when semantically equivalent sources are considered (69.41% vs 58.82%), albeit with higher latency (5.02s vs 0.79s). The study demonstrates that structured, multi-agent pipelines enhance retrieval robustness in complex, domain-specific environments, while highlighting trade-offs and avenues for future improvements such as adaptive agent coordination and stronger context-awareness.
Abstract
Retrieval-Augmented Generation (RAG) systems often face limitations in specialized domains such as fintech, where domain-specific ontologies, dense terminology, and acronyms complicate effective retrieval and synthesis. This paper introduces an agentic RAG architecture designed to address these challenges through a modular pipeline of specialized agents. The proposed system supports intelligent query reformulation, iterative sub-query decomposition guided by keyphrase extraction, contextual acronym resolution, and cross-encoder-based context re-ranking. We evaluate our approach against a standard RAG baseline using a curated dataset of 85 question--answer--reference triples derived from an enterprise fintech knowledge base. Experimental results demonstrate that the agentic RAG system outperforms the baseline in retrieval precision and relevance, albeit with increased latency. These findings suggest that structured, multi-agent methodologies offer a promising direction for enhancing retrieval robustness in complex, domain-specific settings.
