Automatic Logical Forms improve fidelity in Table-to-Text generation
Iñigo Alonso, Eneko Agirre
TL;DR
The paper tackles the fidelity gap in table-to-text generation by introducing TlT, a two-stage framework that first generates automatic logical forms from a table and then renders text from those forms. It demonstrates that automatic LFs yield substantial fidelity gains (about 30 points) over non-LF baselines, with content selection and LF-to-text components contributing most to improvement. The authors conduct automatic and human evaluations, ablations, and qualitative analyses to quantify remaining challenges in content selection, LF parsing, and logic-to-text generation. The work paves the way for practical, verifiable data-to-text systems and provides open-source resources to extend LF-based approaches to other structured inputs.
Abstract
Table-to-text systems generate natural language statements from structured data like tables. While end-to-end techniques suffer from low factual correctness (fidelity), a previous study reported gains when using manual logical forms (LF) that represent the selected content and the semantics of the target text. Given the manual step, it was not clear whether automatic LFs would be effective, or whether the improvement came from content selection alone. We present TlT which, given a table and a selection of the content, first produces LFs and then the textual statement. We show for the first time that automatic LFs improve quality, with an increase in fidelity of 30 points over a comparable system not using LFs. Our experiments allow to quantify the remaining challenges for high factual correctness, with automatic selection of content coming first, followed by better Logic-to-Text generation and, to a lesser extent, better Table-to-Logic parsing.
