RISE: Reasoning Enhancement via Iterative Self-Exploration in Multi-hop Question Answering
Bolei He, Xinran He, Mengke Chen, Xianwei Xue, Ying Zhu, Zhenhua Ling
TL;DR
RISE introduces a self-driven iterative framework that merges Retrieval-Augmented Generation with self-exploration to tackle multi-hop QA. By cycling through question decomposition, retrieve-then-read, and self-critique, it generates task-specific data and optimizes across multiple objectives to progressively enhance reasoning and evidence integration. Empirical results on MHQA benchmarks show significant improvements in reasoning accuracy and task performance, with ablations confirming the complementary value of each component. The approach also demonstrates robustness across iterations and model variants, while highlighting a cost-efficient path to improved complex reasoning without heavy external supervision.
Abstract
Large Language Models (LLMs) excel in many areas but continue to face challenges with complex reasoning tasks, such as Multi-Hop Question Answering (MHQA). MHQA requires integrating evidence from diverse sources while managing intricate logical dependencies, often leads to errors in reasoning. Retrieval-Augmented Generation (RAG), widely employed in MHQA tasks, faces challenges in effectively filtering noisy data and retrieving all necessary evidence, thereby limiting its effectiveness in addressing MHQA challenges. To address these challenges, we propose RISE:Reasoning Enhancement via Iterative Self-Exploration, a novel framework designed to enhance models' reasoning capability through iterative self-exploration. Specifically, RISE involves three key steps in addressing MHQA tasks: question decomposition, retrieve-then-read, and self-critique. By leveraging continuous self-exploration, RISE identifies accurate reasoning paths, iteratively self-improving the model's capability to integrate evidence, maintain logical consistency, and enhance performance in MHQA tasks. Extensive experiments on multiple MHQA benchmarks demonstrate that RISE significantly improves reasoning accuracy and task performance.
