SR-LLM: Rethinking the Structured Representation in Large Language Model
Jiahuan Zhang, Tianheng Wang, Hanqing Wu, Ziyi Huang, Yulong Wu, Dongbai Chen, Linfeng Song, Yue Zhang, Guozheng Rao, Kaicheng Yu
TL;DR
SR-LLM investigates integrating structured representations (AMR, PST, FOL) with LLMs via training-free SR-NLD and training-dependent Gen-SR. It demonstrates that converting SR into natural language descriptions and fine-tuning on SR data can outperform code-based SR prompts, achieving notable gains on PAWS (+3.17% training-free, +12.38% training-dependent). The work provides a thorough empirical evaluation across 10 NLP tasks, including robustness checks with high-quality SR parsers and larger models, and shows SR-NLD improves weaker models more than strong ones. The findings suggest a path to improve LLM reasoning and interoperability through structured representations.
Abstract
Structured representations, exemplified by Abstract Meaning Representation (AMR), have long been pivotal in computational linguistics. However, their role remains ambiguous in the Large Language Models (LLMs) era. Initial attempts to integrate structured representation into LLMs via a zero-shot setting yielded inferior performance. We hypothesize that such a decline stems from the structure information being passed into LLMs in a code format unfamiliar to LLMs' training corpora. Consequently, we propose SR-LLM, an innovative framework with two settings to explore a superior way of integrating structured representation with LLMs from training-free and training-dependent perspectives. The former integrates structural information through natural language descriptions in LLM prompts, whereas its counterpart augments the model's inference capability through fine-tuning on linguistically described structured representations. Performance improvements were observed in widely downstream datasets, with particularly notable gains of 3.17% and 12.38% in PAWS. To the best of our knowledge, this work represents the pioneering demonstration that leveraging structural representations can substantially enhance LLMs' inference capability. We hope that our work sheds light and encourages future research to enhance the reasoning and interoperability of LLMs by structure data.
