Aligning Extraction and Generation for Robust Retrieval-Augmented Generation
Hwanjun Song, Jeonghwan Choi, Minseok Kim
TL;DR
Ext2Gen tackles robust retrieval-augmented generation by training LLMs to extract query-relevant evidence from noisy retrievals before generating answers, thereby mitigating the effects of chunk misplacement and information noisiness. The method jointly optimizes extraction and generation through preference-aligned, pairwise feedback derived from multiple LLMs and QA metrics, removing the need for pre-generation compression. Empirical results show Ext2Gen substantially improves robustness over compression-based baselines and benefits further from improved retrieval techniques, with good generalization to other backbones and deployment in real RAG pipelines. The work demonstrates that generation-side enhancements provide complementary gains beyond retrieval improvements in practical RAG systems.
Abstract
Retrieval-augmented generation (RAG) enhances LLMs with external knowledge, yet generation remains vulnerable to retrieval-induced noise and uncertain placement of relevant chunks, often causing hallucinations. We present Ext2Gen, an extract-then-generate framework that strengthens LLMs via joint evidence selection and answer generation, dynamically identifying query-relevant content while suppressing noise, thereby removing the need for any independent pre-generation compression module. Optimized through preference alignment with well-curated pairwise feedback, Ext2Gen produces accurate and faithful answers even under noisy or imprecise retrieval. Experiments demonstrate that it substantially enhances the robustness of the generation backbone and yields greater performance gains than methods relying on independent compression models, e.g., Recomp, CompAct, EXIT). It further benefits from improved retrieval techniques such as query rewriting, underscoring that generation-side enhancements address limitations that retrieval alone cannot overcome.
