RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback

Yanming Liu; Xinyue Peng; Xuhong Zhang; Weihao Liu; Jianwei Yin; Jiannan Cao; Tianyu Du

RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback

Yanming Liu, Xinyue Peng, Xuhong Zhang, Weihao Liu, Jianwei Yin, Jiannan Cao, Tianyu Du

TL;DR

RA-ISF presents a three-module iterative framework for retrieval-augmented question answering that jointly reasons over internal knowledge, retrieved passages, and decomposed sub-questions. By training lightweight sub-models for self-knowledge, relevance, and decomposition, and employing iterative refinement with a thresholded depth, RA-ISF reduces hallucinations and improves factual accuracy across diverse OpenQA benchmarks. Empirical results show large gains over retrieval baselines on GPT-3.5 and competitive gains with Llama-2, along with thorough ablations and efficiency analyses. The work highlights the practical impact of combining modular reasoning with retrieval in a controlled, iterative loop for robust knowledge-intensive QA.

Abstract

Large language models (LLMs) demonstrate exceptional performance in numerous tasks but still heavily rely on knowledge stored in their parameters. Moreover, updating this knowledge incurs high training costs. Retrieval-augmented generation (RAG) methods address this issue by integrating external knowledge. The model can answer questions it couldn't previously by retrieving knowledge relevant to the query. This approach improves performance in certain scenarios for specific tasks. However, if irrelevant texts are retrieved, it may impair model performance. In this paper, we propose Retrieval Augmented Iterative Self-Feedback (RA-ISF), a framework that iteratively decomposes tasks and processes them in three submodules to enhance the model's problem-solving capabilities. Experiments show that our method outperforms existing benchmarks, performing well on models like GPT3.5, Llama2, significantly enhancing factual reasoning capabilities and reducing hallucinations.

RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback

TL;DR

Abstract

Paper Structure (32 sections, 5 equations, 4 figures, 7 tables, 1 algorithm)

This paper contains 32 sections, 5 equations, 4 figures, 7 tables, 1 algorithm.

Introduction
Related Work
Retrieval Augmented Language Model
Task Decomposition
Methodology
Overview
RA-ISF Training
RA-ISF Inference
Experimental Setup
Datasets
Models
Retriever and Corpus
Baselines
Implementation Details
Experiment Results
...and 17 more sections

Figures (4)

Figure 1: Overview of RA-ISF. It consists of three sub-modules: self-knowledge, passage relevance, and question decomposition.
Figure 2: Question accuracy on the NQ dataset with the growth of the iteration in question decomposition $D_{th}$.
Figure 3: Trend of question accuracy on the NQ dataset with the growth of the iteration in question decomposition $k$.
Figure 4: Trend of question accuracy on the NQ and TriviaQA dataset with the growth of the iteration in question decomposition $k$.

RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback

TL;DR

Abstract

RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback

Authors

TL;DR

Abstract

Table of Contents

Figures (4)