When Structure Doesn't Help: LLMs Do Not Read Text-Attributed Graphs as Effectively as We Expected
Haotian Xu, Yuning You, Tengfei Ma
TL;DR
The paper interrogates whether explicit graph structure improves LLM-based graph reasoning, across text-attributed graphs and molecular graphs. Through systematic ablations of template-based encodings, GNN adapters, and various backbones, it finds that rich node semantics largely drive performance, while structural priors (e.g., Laplacian positional encodings, message passing) offer marginal or negative gains. This challenges the conventional emphasis on topology in graph learning and suggests a shift toward semantics-driven representations and node sequencing to leverage LLM capabilities. The findings hold across diverse tasks and datasets, including TAGs and molecular benchmarks, with implications for designing future graph foundation models that prioritize meaningful textual context over handcrafted structural cues.
Abstract
Graphs provide a unified representation of semantic content and relational structure, making them a natural fit for domains such as molecular modeling, citation networks, and social graphs. Meanwhile, large language models (LLMs) have excelled at understanding natural language and integrating cross-modal signals, sparking interest in their potential for graph reasoning. Recent work has explored this by either designing template-based graph templates or using graph neural networks (GNNs) to encode structural information. In this study, we investigate how different strategies for encoding graph structure affect LLM performance on text-attributed graphs. Surprisingly, our systematic experiments reveal that: (i) LLMs leveraging only node textual descriptions already achieve strong performance across tasks; and (ii) most structural encoding strategies offer marginal or even negative gains. We show that explicit structural priors are often unnecessary and, in some cases, counterproductive when powerful language models are involved. This represents a significant departure from traditional graph learning paradigms and highlights the need to rethink how structure should be represented and utilized in the LLM era. Our study is to systematically challenge the foundational assumption that structure is inherently beneficial for LLM-based graph reasoning, opening the door to new, semantics-driven approaches for graph learning.
