FinVet: A Collaborative Framework of RAG and External Fact-Checking Agents for Financial Misinformation Detection
Daniel Berhane Araya, Duoduo Liao
TL;DR
This paper tackles the challenge of financial misinformation by introducing FinVet, a multi-agent framework that fuses two Retrieval-Augmented Generation (RAG) pipelines with an external fact-checking module, all coordinated via a confidence-weighted voting mechanism. An adaptive three-tier evidence processing strategy uses retrieval similarity to switch among direct metadata extraction, hybrid reasoning, and model-based analysis, delivering transparent verdicts with source attribution and quantified uncertainty. The framework is validated on the FinFact dataset, achieving an F1 score of 0.85, a substantial improvement over both single pipelines and standalone RAG approaches, and demonstrating the benefits of collaborative verification for explainability in finance. These results suggest FinVet’s approach provides robust, auditable detection of financial misinformation with practical implications for institutions and regulators managing market integrity and investor protection.
Abstract
Financial markets face growing threats from misinformation that can trigger billions in losses in minutes. Most existing approaches lack transparency in their decision-making and provide limited attribution to credible sources. We introduce FinVet, a novel multi-agent framework that integrates two Retrieval-Augmented Generation (RAG) pipelines with external fact-checking through a confidence-weighted voting mechanism. FinVet employs adaptive three-tier processing that dynamically adjusts verification strategies based on retrieval confidence, from direct metadata extraction to hybrid reasoning to full model-based analysis. Unlike existing methods, FinVet provides evidence-backed verdicts, source attribution, confidence scores, and explicit uncertainty flags when evidence is insufficient. Experimental evaluation on the FinFact dataset shows that FinVet achieves an F1 score of 0.85, which is a 10.4% improvement over the best individual pipeline (fact-check pipeline) and 37% improvement over standalone RAG approaches.
