Hybrid Graphs for Table-and-Text based Question Answering using LLMs
Ankush Agarwal, Ganesh S, Chaitanya Devaguptapu
TL;DR
ODYSSEY tackles Table-Text QA in a zero-shot, fine-tuning-free setting by constructing a Hybrid Graph that unifies tables and linked passages and by pruning context through question-guided traversal. The method achieves state-of-the-art zero-shot performance on Hybrid-QA and OTT-QA across GPT-3.5, GPT-4, and LLaMA-3, with notable gains in Exact Match and F1 and substantial reductions in input tokens. Key innovations include question-driven entity-header mapping, a multi-hop BFS traversal over a Hybrid Graph, and iterative reader prompts that feed compact, relevant context to the LLM. The work demonstrates the practicality of efficient, cross-source reasoning and suggests directions for multi-modal extension and further efficiency improvements.
Abstract
Answering questions that require reasoning and aggregation across both structured (tables) and unstructured (raw text) data sources presents significant challenges. Current methods rely on fine-tuning and high-quality, human-curated data, which is difficult to obtain. Recent advances in Large Language Models (LLMs) have shown promising results for multi-hop question answering (QA) over single-source text data in a zero-shot setting, yet exploration into multi-source Table-Text QA remains limited. In this paper, we present a novel Hybrid Graph-based approach for Table-Text QA that leverages LLMs without fine-tuning. Our method constructs a unified Hybrid Graph from textual and tabular data, pruning information based on the input question to provide the LLM with relevant context concisely. We evaluate our approach on the challenging Hybrid-QA and OTT-QA datasets using state-of-the-art LLMs, including GPT-3.5, GPT-4, and LLaMA-3. Our method achieves the best zero-shot performance on both datasets, improving Exact Match scores by up to 10% on Hybrid-QA and 5.4% on OTT-QA. Moreover, our approach reduces token usage by up to 53% compared to the original context.
