LGM: Enhancing Large Language Models with Conceptual Meta-Relations and Iterative Retrieval
Wenchang Lei, Ping Zou, Yue Wang, Feng Sun, Lei Zhao
TL;DR
The paper tackles the problem of LLMs misinterpreting ambiguous or misaligned terms by introducing the Language Graph Model (LGM), which uses meta-relations—inheritance, alias, and composition—to define concepts more clearly. It combines a dual-graph representation (SRG for syntactic structure and CRG for concept relations), a two-phase Learning and Concept Iterative Retrieval workflow, and a reflection mechanism to validate extracted relations, enabling long-text and multi-hop reasoning without context-window limits. Through Concept Expansion and parallel retrieval with iterative merging, LGM reduces noise from raw passages and bridges dependencies across hops. Empirical results on HotpotQA and Musique show consistent improvements over state-of-the-art RAG baselines, demonstrating strong benefits for concept-centric retrieval in complex QA tasks. The work also provides a suite of prompts and datasets to support meta-relation extraction, reflection validation, and LGM reasoning, contributing to scalable, explainable retrieval-augmented reasoning for large language models.
Abstract
Large language models (LLMs) exhibit strong semantic understanding, yet struggle when user instructions involve ambiguous or conceptually misaligned terms. We propose the Language Graph Model (LGM) to enhance conceptual clarity by extracting meta-relations-inheritance, alias, and composition-from natural language. The model further employs a reflection mechanism to validate these meta-relations. Leveraging a Concept Iterative Retrieval Algorithm, these relations and related descriptions are dynamically supplied to the LLM, improving its ability to interpret concepts and generate accurate responses. Unlike conventional Retrieval-Augmented Generation (RAG) approaches that rely on extended context windows, our method enables large language models to process texts of any length without the need for truncation. Experiments on standard benchmarks demonstrate that the LGM consistently outperforms existing RAG baselines.
