Retrieval Meets Reasoning: Dynamic In-Context Editing for Long-Text Understanding
Weizhi Fei, Xueyan Niu, Guoqing Xie, Yanhua Zhang, Bo Bai, Lei Deng, Wei Han
TL;DR
This work reframes long-context reasoning as interactive knowledge editing, enabling LLMs with fixed context windows to perform multi-hop reasoning by planning sub-questions and retrieving relevant context chunks within a DAG-based framework. It introduces two algorithms—Iterative QA with fact extraction and Knowledge-constrained decoding—built on planning and retrieval modules to update the model’s reasoning without parameter updates. Across long-context QA benchmarks and a synthetic variable-tracking task, the proposed approach outperforms fixed-window baselines and competitive long-context methods, with the knowledge-constrained decoding variant delivering the strongest results. The method offers a practical, plug-and-play path to enhance long-context reasoning on commodity hardware, albeit with limitations tied to dataset scope and prompt generalization, and it highlights the conceptual link between long-context reasoning and knowledge editing.
Abstract
Current Large Language Models (LLMs) face inherent limitations due to their pre-defined context lengths, which impede their capacity for multi-hop reasoning within extensive textual contexts. While existing techniques like Retrieval-Augmented Generation (RAG) have attempted to bridge this gap by sourcing external information, they fall short when direct answers are not readily available. We introduce a novel approach that re-imagines information retrieval through dynamic in-context editing, inspired by recent breakthroughs in knowledge editing. By treating lengthy contexts as malleable external knowledge, our method interactively gathers and integrates relevant information, thereby enabling LLMs to perform sophisticated reasoning steps. Experimental results demonstrate that our method effectively empowers context-limited LLMs, such as Llama2, to engage in multi-hop reasoning with improved performance, which outperforms state-of-the-art context window extrapolation methods and even compares favorably to more advanced commercial long-context models. Our interactive method not only enhances reasoning capabilities but also mitigates the associated training and computational costs, making it a pragmatic solution for enhancing LLMs' reasoning within expansive contexts.
