Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models
Sitao Cheng, Liangming Pan, Xunjian Yin, Xinyi Wang, William Yang Wang
TL;DR
The paper investigates how large language models balance their internal parametric knowledge (PK) with externally provided contextual knowledge (CK). By introducing EchoQA, a benchmark that spans scientific, factual, and commonsense domains, the authors categorize CK–PK interactions into four types and examine model behavior under progressively enforced reasoning instructions. Across complementary, conflicting, and irrelevant CK scenarios, they find universal PK suppression by CK and only partial recovery of PK leverage through instructions, revealing a reliability vulnerability in knowledge-intensive tasks. The work highlights factors such as knowledge type and entity popularity that modulate PK recall and suggests directions like agent-based recall-before-reasoning and post-training integration to improve PK–CK fusion. Overall, EchoQA provides a valuable testbed for understanding and enhancing the integration of PK and CK in modern LLMs.
Abstract
Large language models (LLMs) encode vast amounts of knowledge during pre-training (parametric knowledge, or PK) and can further be enhanced by incorporating contextual knowledge (CK). Can LLMs effectively integrate their internal PK with external CK to solve complex problems? In this paper, we investigate the dynamic interaction between PK and CK, categorizing their relationships into four types: Supportive, Complementary, Conflicting, and Irrelevant. To support this investigation, we introduce ECHOQA, a benchmark spanning scientific, factual, and commonsense knowledge. Our results show that LLMs tend to suppress their PK when contextual information is available, even when it is complementary or irrelevant. While tailored instructions can encourage LLMs to rely more on their PK, they still struggle to fully leverage it. These findings reveal a key vulnerability in LLMs, raising concerns about their reliability in knowledge-intensive tasks. Resources are available at https://github.com/sitaocheng/Knowledge_Interplay
