ScienceStack
Explore
Plans and pricing
FAQ
Contact us
Table of Contents
Fetching ...
Sign In
Toggle theme
Paper
Download
MRAG-Suite: A Diagnostic Evaluation Platform for Visual Retrieval-Augmented Generation
Published: September 29, 2025
arXiv: 2509.24253v2
Authors
Yuelyu Ji
Abstract
Multimodal Retrieval-Augmented Generation (Visual RAG) significantly advances question answering by integrating visual and textual evidence. Yet, current evaluations fail to systematically account for query difficulty and ambiguity. We propose MRAG-Suite, a diagnostic evaluation platform integrating diverse multimodal benchmarks (WebQA, Chart-RAG, Visual-RAG, MRAG-Bench). We introduce difficulty-based and ambiguity-aware filtering strategies, alongside MM-RAGChecker, a claim-level diagnostic tool. Our results demonstrate substantial accuracy reductions under difficult and ambiguous queries, highlighting prevalent hallucinations. MM-RAGChecker effectively diagnoses these issues, guiding future improvements in Visual RAG systems.