Table of Contents
Fetching ...

Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning

Jiashu He, Jinxuan Fan, Bowen Jiang, Ignacio Houine, Dan Roth, Alejandro Ribeiro

TL;DR

Self-GIVE is proposed, a retrieve-RL framework that enhances LLMs with automatic associative thinking through reinforcement learning, and enhances the scalable integration of structured retrieval and reasoning with associative thinking.

Abstract

When addressing complex questions that require new information, people often associate the question with existing knowledge to derive a sensible answer. For instance, when evaluating whether melatonin aids insomnia, one might associate "hormones helping mental disorders" with "melatonin being a hormone and insomnia a mental disorder" to complete the reasoning. Large Language Models (LLMs) also require such associative thinking, particularly in resolving scientific inquiries when retrieved knowledge is insufficient and does not directly answer the question. Graph Inspired Veracity Extrapolation (GIVE) addresses this by using a knowledge graph (KG) to extrapolate structured knowledge. However, it involves the construction and pruning of many hypothetical triplets, which limits efficiency and generalizability. We propose Self-GIVE, a retrieve-RL framework that enhances LLMs with automatic associative thinking through reinforcement learning. Self-GIVE extracts structured information and entity sets to assist the model in linking to the queried concepts. We address GIVE's key limitations: (1) extensive LLM calls and token overhead for knowledge extrapolation, (2) difficulty in deploying on smaller LLMs (3B or 7B) due to complex instructions, and (3) inaccurate knowledge from LLM pruning. Specifically, after fine-tuning using self-GIVE with a 135 node UMLS KG, it improves the performance of the Qwen2.5 3B and 7B models by up to $\textbf{28.5%$\rightarrow$71.4%}$ and $\textbf{78.6$\rightarrow$90.5%}$ in samples $\textbf{unseen}$ in challenging biomedical QA tasks. In particular, Self-GIVE allows the 7B model to match or outperform GPT3.5 turbo with GIVE, while cutting token usage by over 90%. Self-GIVE enhances the scalable integration of structured retrieval and reasoning with associative thinking.

Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning

TL;DR

Self-GIVE is proposed, a retrieve-RL framework that enhances LLMs with automatic associative thinking through reinforcement learning, and enhances the scalable integration of structured retrieval and reasoning with associative thinking.

Abstract

When addressing complex questions that require new information, people often associate the question with existing knowledge to derive a sensible answer. For instance, when evaluating whether melatonin aids insomnia, one might associate "hormones helping mental disorders" with "melatonin being a hormone and insomnia a mental disorder" to complete the reasoning. Large Language Models (LLMs) also require such associative thinking, particularly in resolving scientific inquiries when retrieved knowledge is insufficient and does not directly answer the question. Graph Inspired Veracity Extrapolation (GIVE) addresses this by using a knowledge graph (KG) to extrapolate structured knowledge. However, it involves the construction and pruning of many hypothetical triplets, which limits efficiency and generalizability. We propose Self-GIVE, a retrieve-RL framework that enhances LLMs with automatic associative thinking through reinforcement learning. Self-GIVE extracts structured information and entity sets to assist the model in linking to the queried concepts. We address GIVE's key limitations: (1) extensive LLM calls and token overhead for knowledge extrapolation, (2) difficulty in deploying on smaller LLMs (3B or 7B) due to complex instructions, and (3) inaccurate knowledge from LLM pruning. Specifically, after fine-tuning using self-GIVE with a 135 node UMLS KG, it improves the performance of the Qwen2.5 3B and 7B models by up to \rightarrow and \rightarrow in samples in challenging biomedical QA tasks. In particular, Self-GIVE allows the 7B model to match or outperform GPT3.5 turbo with GIVE, while cutting token usage by over 90%. Self-GIVE enhances the scalable integration of structured retrieval and reasoning with associative thinking.

Paper Structure

This paper contains 25 sections, 10 equations, 8 figures, 4 tables.

Figures (8)

  • Figure 1: Training/inference framework of Self-GIVE: For query $x$, we decompose it into key entities $\{e^k_x\}$, and utilize the semantically similar concepts in the knowledge base to construct entity groups $\mathbb{N}_x$; Based on these information, the model is fine-tuned to conduct associative thinking to solve the query using reinforcement learning with accuracy reward.
  • Figure 2: On challenging scientific questions, LLM falls short of parametric knowledge; RAG retrieves semantically similar but irrelevant information; GIVE yields to recessive inference time model calls and token consumption because of the construction and pruning of the hypothetical triplets; Self-GIVE provides a faithful and efficient reasoning process by teaching the model to construct additional triplets by itself and conduct associative thinking correctly.
  • Figure 3: GPT4.1 win rate comparison on PubmedQA "long answer" task. Reponses from each competing method is given to the scorer model with gold answer. We report the percentage to achieve the best score for each method.
  • Figure 4: QA accuracy of finetuned 3B-Instruct model with different no. additional entities per group
  • Figure 5: QA accuracy of Self-GIVE on questions with different no. knowledge triplets induced per entity group
  • ...and 3 more figures