A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models

Utkarsh Sahu; Zhisheng Qi; Yongjia Lei; Ryan A. Rossi; Franck Dernoncourt; Nesreen K. Ahmed; Mahantesh M Halappanavar; Yao Ma; Yu Wang

A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models

Utkarsh Sahu, Zhisheng Qi, Yongjia Lei, Ryan A. Rossi, Franck Dernoncourt, Nesreen K. Ahmed, Mahantesh M Halappanavar, Yao Ma, Yu Wang

TL;DR

This work tackles the problem of understanding how knowledge is structurally organized in large language models by proposing a graph-centric framework. It defines triplet-level and entity-level knowledgeability, analyzes their relation to graph properties such as node degree and homophily, and demonstrates that entities exhibit strong neighborhood similarity, i.e., knowledgeability homophily. A graph neural network regression model leverages local neighborhood information to predict entity knowledgeability, enabling active selection of informative triplets for fine-tuning LLMs, with Graph-FT consistently outperforming random triplet selection. The findings suggest practical pathways for more efficient knowledge checking, targeted data curation, and potential extensions to knowledge graph retrieval and deficiency detection, grounded in the observed topological patterns of LLM knowledge.

Abstract

Large language models have been extensively studied as neural knowledge bases for their knowledge access, editability, reasoning, and explainability. However, few works focus on the structural patterns of their knowledge. Motivated by this gap, we investigate these structural patterns from a graph perspective. We quantify the knowledge of LLMs at both the triplet and entity levels, and analyze how it relates to graph structural properties such as node degree. Furthermore, we uncover the knowledge homophily, where topologically close entities exhibit similar levels of knowledgeability, which further motivates us to develop graph machine learning models to estimate entity knowledge based on its local neighbors. This model further enables valuable knowledge checking by selecting triplets less known to LLMs. Empirical results show that using selected triplets for fine-tuning leads to superior performance.

A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models

TL;DR

Abstract

A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)