Uncertainty Quantification in Table Structure Recognition
Kehinde Ajayi, Leizhen Zhang, Yi He, Jian Wu
TL;DR
This paper tackles uncertainty quantification in table structure recognition (TSR) by introducing a novel Test-Time Augmentation-based UQ pipeline, termed TTA-m, which trains multiple fine-tuned TSR models on augmented data and ensembles their predictions. Uncertainty is quantified through an ensemble mechanism that merges predictions using an IoU threshold $\theta_0$ and assigns a per-cell confidence score $\frac{c}{M+1}$, where $c$ is the number of contributing models. To enable evaluation without ground-truth uncertainty, the authors propose two proxies: masking (varying pixel intensities) and cell complexity (modeled via a cell adjacency graph). Evaluations on the ICDAR-19 TSR dataset with CascadeTabNet show that TTA-m improves cell-detection F1 and provides reliable confidence-based uncertainty estimates, albeit with higher computational cost. The approach advances practical TSR by enabling selective human review and lays groundwork for robust, uncertainty-aware document understanding systems, especially in scientific domains.
Abstract
Quantifying uncertainties for machine learning models is a critical step to reduce human verification effort by detecting predictions with low confidence. This paper proposes a method for uncertainty quantification (UQ) of table structure recognition (TSR). The proposed UQ method is built upon a mixture-of-expert approach termed Test-Time Augmentation (TTA). Our key idea is to enrich and diversify the table representations, to spotlight the cells with high recognition uncertainties. To evaluate the effectiveness, we proposed two heuristics to differentiate highly uncertain cells from normal cells, namely, masking and cell complexity quantification. Masking involves varying the pixel intensity to deem the detection uncertainty. Cell complexity quantification gauges the uncertainty of each cell by its topological relation with neighboring cells. The evaluation results based on standard benchmark datasets demonstrate that the proposed method is effective in quantifying uncertainty in TSR models. To our best knowledge, this study is the first of its kind to enable UQ in TSR tasks. Our code and data are available at: https://github.com/lamps-lab/UQTTA.git.
