RuAG: Learned-rule-augmented Generation for Large Language Models

Yudi Zhang; Pei Xiao; Lu Wang; Chaoyun Zhang; Meng Fang; Yali Du; Yevgeniy Puzyrev; Randolph Yao; Si Qin; Qingwei Lin; Mykola Pechenizkiy; Dongmei Zhang; Saravan Rajmohan; Qi Zhang

RuAG: Learned-rule-augmented Generation for Large Language Models

Yudi Zhang, Pei Xiao, Lu Wang, Chaoyun Zhang, Meng Fang, Yali Du, Yevgeniy Puzyrev, Randolph Yao, Si Qin, Qingwei Lin, Mykola Pechenizkiy, Dongmei Zhang, Saravan Rajmohan, Qi Zhang

TL;DR

A novel framework to automatically distill large volumes of offline data into interpretable first-order logic rules, which are injected into LLMs to boost their reasoning capabilities, and its effectiveness in enhancing LLM's capability over diverse tasks is evaluated.

Abstract

In-context learning (ICL) and Retrieval-Augmented Generation (RAG) have gained attention for their ability to enhance LLMs' reasoning by incorporating external knowledge but suffer from limited contextual window size, leading to insufficient information injection. To this end, we propose a novel framework, RuAG, to automatically distill large volumes of offline data into interpretable first-order logic rules, which are injected into LLMs to boost their reasoning capabilities. Our method begins by formulating the search process relying on LLMs' commonsense, where LLMs automatically define head and body predicates. Then, RuAG applies Monte Carlo Tree Search (MCTS) to address the combinational searching space and efficiently discover logic rules from data. The resulting logic rules are translated into natural language, allowing targeted knowledge injection and seamless integration into LLM prompts for LLM's downstream task reasoning. We evaluate our framework on public and private industrial tasks, including natural language processing, time-series, decision-making, and industrial tasks, demonstrating its effectiveness in enhancing LLM's capability over diverse tasks.

RuAG: Learned-rule-augmented Generation for Large Language Models

TL;DR

Abstract

RuAG: Learned-rule-augmented Generation for Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)