Well-Conditioned Polynomial Representations for Mathematical Handwriting Recognition
Robert M. Corless, Deepak Singh Kalhan, Stephen M. Watt
TL;DR
The paper addresses robustly representing handwritten mathematical strokes as parametric polynomials and systematically analyzes how the choice of basis (Legendre, Chebyshev, Legendre-Sobolev, Chebyshev-Sobolev) and polynomial degree affect conditioning, accuracy, and computational cost. It derives a Sobolev-norm bound $||f - g||_s ≤ sqrt(n) (1 + μ ||D||) ||f - g||_∞$ and investigates the impact of the differentiation matrix on stability, supported by experiments on real handwriting data. Empirical results show Sobolev bases significantly restrain coefficient growth and stabilize higher-degree representations, with Chebyshev-Sobolov yielding the best recognition accuracy (~97.5–98% around degree 12) at the cost of higher computation time. The work provides actionable guidance for selecting bases and degrees to balance efficiency and robustness in digital ink analysis and mathematical handwriting recognition.
Abstract
Previous work has made use of a parameterized plane curve polynomial representation for mathematical handwriting, with the polynomials represented in a Legendre or Legendre-Sobolev graded basis. This provides a compact geometric representation for the digital ink. Preliminary results have also been shown for Chebyshev and Chebyshev-Sobolev bases. This article explores the trade-offs between basis choice and polynomial degree to achieve accurate modeling with a low computational cost. To do this, we consider the condition number for polynomial evaluation in these bases and bound how the various inner products give norms for the variations between symbols.
