Overview of the TREC 2025 Retrieval Augmented Generation (RAG) Track

Shivani Upadhyay; Nandan Thakur; Ronak Pradeep; Nick Craswell; Daniel Campos; Jimmy Lin

Overview of the TREC 2025 Retrieval Augmented Generation (RAG) Track

Shivani Upadhyay, Nandan Thakur, Ronak Pradeep, Nick Craswell, Daniel Campos, Jimmy Lin

TL;DR

This year's TREC 2025 RAG Track introduces long, multi-sentence narrative queries to better reflect the deep search task with the growing demand for reasoning-driven responses.

Abstract

The second edition of the TREC Retrieval Augmented Generation (RAG) Track advances research on systems that integrate retrieval and generation to address complex, real-world information needs. Building on the foundation of the inaugural 2024 track, this year's challenge introduces long, multi-sentence narrative queries to better reflect the deep search task with the growing demand for reasoning-driven responses. Participants are tasked with designing pipelines that combine retrieval and generation while ensuring transparency and factual grounding. The track leverages the MS MARCO V2.1 corpus and employs a multi-layered evaluation framework encompassing relevance assessment, response completeness, attribution verification, and agreement analysis. By emphasizing multi-faceted narratives and attribution-rich answers from over 150 submissions this year, the TREC 2025 RAG Track aims to foster innovation in creating trustworthy, context-aware systems for retrieval augmented generation.

Overview of the TREC 2025 Retrieval Augmented Generation (RAG) Track

TL;DR

This year's TREC 2025 RAG Track introduces long, multi-sentence narrative queries to better reflect the deep search task with the growing demand for reasoning-driven responses.

Abstract

Paper Structure (23 sections, 2 equations, 8 figures, 13 tables)

This paper contains 23 sections, 2 equations, 8 figures, 13 tables.

Introduction
Task Setup
Reference Ranked Lists.
Narrative Generation.
Sub-Narratives Generation.
Evaluation
Relevance Assessment
Automatic Assessment.
Response Evaluation
AutoNuggets + AutoAssign.
Post-Edit Nuggets + AutoAssign.
Post-Edit Nuggets + ManualAssign.
Support Evaluation
Weighted Precision.
Weighted Recall.
...and 8 more sections

Figures (8)

Figure 1: Overview of the narrative generation process in the TREC 2025 RAG Track.
Figure 2: Prompt used in the first step of the sub-narrative generation process with GPT-4.1 nano to generate an exhaustive list of sub-queries using the given narrative and query cluster as input.
Figure 3: Prompt used in the second step of refining the sub-narratives with GPT-4.1 to generate an exhaustive list of sub-queries using the given narrative and the sub-queries generated in the previous step as input.
Figure 4: Example of narrative and sub-narrative pair used in TREC 2025 RAG.
Figure 5: Prompt for assigning relevance labels, where the inputs are the sub-narrative list, narrative and passage pair.
...and 3 more figures

Overview of the TREC 2025 Retrieval Augmented Generation (RAG) Track

TL;DR

Abstract

Overview of the TREC 2025 Retrieval Augmented Generation (RAG) Track

Authors

TL;DR

Abstract

Table of Contents

Figures (8)