TASER: Table Agents for Schema-guided Extraction and Recommendation
Nicole Cho, Kirsty Fielding, William Watson, Sumitra Ganesh, Manuela Veloso
TL;DR
TASER tackles the challenge of extracting structured holdings data from highly heterogeneous, multi-page financial tables by employing a three-agent, schema-guided extraction framework. A Detector, Extractor, and Recommender operate in a recursive loop over an initial Portfolio schema, with the Recommender refining the schema and triggering re-extraction to improve fidelity. Empirical results show TASER outperforms the Table Transformer by 10.1% in detection and achieves superior dollar-value fidelity, aided by adaptive batch-size strategies that balance schema coverage, diversity, and precision. The work introduces TASERTab, a large real-world financial table dataset, and demonstrates the practical viability of continuous, schema-guided, agent-based extraction for complex regulatory documents.
Abstract
Real-world financial documents report essential information about an entity's financial holdings that can span millions of different financial instrument types. Yet, these details are often buried in messy, multi-page, fragmented tables - for example, 99.4% of the tables in our dataset have no bounding boxes with the maximum number of rows amounting to 426 per table across 44 pages. To tackle these unique challenges from real-world tables, we present a continuously learning, agentic table extraction system, TASER (Table Agents for Schema-guided Extraction and Recommendation) that extracts highly unstructured, multi-page, heterogeneous tables into normalized, schema-conforming outputs. Our table agents execute on table detection, classification, extraction, and recommendations by leveraging an initial schema. Then, our Recommender Agent reviews the outputs, recommends schema revisions, and decides on the final recommendations, enabling TASER to outperform existing table detection models such as Table Transformer by 10.1%. Within this continuous learning process, we highlight that larger batch sizes result in a 104.3% increase in schema recommendations that are actionable and utilized, resulting in a 9.8% increase in extracted holdings - highlighting the importance of a continuous learning process. To train TASER, we have manually labeled 22,584 pages (28,150,449 tokens), 3,213 tables for $731,685,511,687 of holdings culminating in one of the first real financial table datasets. We release our dataset TASERTab to enable the research community to access real-world financial tables and outputs. Our results highlight the promise of agentic, schema-guided extraction systems for robust understanding of real-world financial tables.
