GraphOTTER: Evolving LLM-based Graph Reasoning for Complex Table Question Answering
Qianlong Li, Chen Huang, Shuai Li, Yuanxin Xiang, Deng Xiong, Wenqiang Lei
TL;DR
GraphOTTER introduces an explicit graph-based reasoning framework for complex table QA, transforming tables into undirected graphs where both headers and data are nodes. It uses a Thought-Action-Update loop with a small set of intermediate actions to perform step-by-step reasoning, guided by a reasoning trace that records visited nodes. By focusing the reasoning on a graph-subset relevant to the question, GraphOTTER better filters irrelevant information and achieves state-of-the-art results across two benchmarks and two LLM backbones. The approach demonstrates notable gains over baselines and highlights the value of explicit graph reasoning and header inference in handling complex table structures. Future work includes extending to multi-turn scenarios and exploring more advanced graph actions to further improve efficiency and accuracy.
Abstract
Complex Table Question Answering involves providing accurate answers to specific questions based on intricate tables that exhibit complex layouts and flexible header locations. Despite considerable progress having been made in the LLM era, the reasoning processes of existing methods are often implicit, feeding the entire table into prompts, making it difficult to effectively filter out irrelevant information in the table. To this end, we propose GraphOTTER that explicitly establishes the reasoning process to pinpoint the correct answers. In particular, GraphOTTER leverages a graph-based representation, transforming the complex table into an undirected graph. It then conducts step-by-step reasoning on the graph, with each step guided by a set of pre-defined intermediate reasoning actions. As such, it constructs a clear reasoning path and effectively identifies the answer to a given question. Comprehensive experiments on two benchmark datasets and two LLM backbones demonstrate the effectiveness of GraphOTTER. Further analysis indicates that its success may be attributed to the ability to efficiently filter out irrelevant information, thereby focusing the reasoning process on the most pertinent data. Our code and experimental datasets are available at \url{https://github.com/JDing0521/GraphOTTER}.
