H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables
Nikhil Abhyankar, Vivek Gupta, Dan Roth, Chandan K. Reddy
TL;DR
H-STAR addresses the core challenge of tabular reasoning by integrating semantic language understanding with symbolic computation in a fixed two-stage pipeline. It first performs multi-view table extraction to produce a query-specific, compact table, then employs adaptive reasoning that uses semantic methods for direct lookups and lexical questions while invoking SQL-based reasoning for quantitative tasks, with SQL-derived evidence feeding the final textual reasoning. Across TabFact, WikiTQ, and FeTaQA, H-STAR consistently outperforms state-of-the-art baselines and demonstrates robustness across multiple LLMs, while ablation studies confirm the essential contribution of both the extraction and adaptive reasoning stages. The approach also delivers practical efficiency gains through targeted extraction, better handling of longer tables, and a manageable generation budget, suggesting strong potential for scalable, accurate tabular QA in real-world settings.
Abstract
Tabular reasoning involves interpreting natural language queries about tabular data, which presents a unique challenge of combining language understanding with structured data analysis. Existing methods employ either textual reasoning, which excels in semantic interpretation but struggles with mathematical operations, or symbolic reasoning, which handles computations well but lacks semantic understanding. This paper introduces a novel algorithm H-STAR that integrates both symbolic and semantic (textual) approaches in a two-stage process to address these limitations. H-STAR employs: (1) step-wise table extraction using `multi-view' column retrieval followed by row extraction, and (2) adaptive reasoning that adapts reasoning strategies based on question types, utilizing semantic reasoning for direct lookup and complex lexical queries while augmenting textual reasoning with symbolic reasoning support for quantitative and logical tasks. Our extensive experiments demonstrate that H-STAR significantly outperforms state-of-the-art methods across three tabular question-answering (QA) and fact-verification datasets, underscoring its effectiveness and efficiency.
