A Study of PHOC Spatial Region Configurations for Math Formula Retrieval
Matt Langsenkamp, Bryan Amador, Richard Zanibbi
TL;DR
The paper tackles visual retrieval of mathematical formulas using symbol-label-and-location representations (PHOC) and investigates whether higher PHOC levels add unique information by introducing concentric rectangles as a new region type and by skipping PHOC levels. It extends XY-PHOC with concentric rectangles and analyzes level-skipping (odd levels and last-level only) on the ARQMath-3 benchmark, comparing against a Bag-of-Words baseline. Findings show that rectangular PHOC variants outperform earlier PHOC configurations and that some higher levels are redundant for retrieval, while PHOC-based methods remain competitive with state-of-the-art despite their simplicity. The work highlights practical trade-offs between precision and ranking depth, and suggests future integration with neural embeddings and cross-domain graphic retrieval.
Abstract
A Pyramidal Histogram Of Characters (PHOC) represents the spatial location of symbols as binary vectors. The vectors are composed of levels that split a formula into equal-sized regions of one or more types (e.g., rectangles or ellipses). For each region type, this produces a pyramid of overlapping regions, where the first level contains the entire formula, and the final level the finest-grained regions. In this work, we introduce concentric rectangles for regions, and analyze whether subsequent PHOC levels encode redundant information by omitting levels from PHOC configurations. As a baseline, we include a bag of words PHOC containing only the first whole-formula level. Finally, using the ARQMath-3 formula retrieval benchmark, we demonstrate that some levels encoded in the original PHOC configurations are redundant, that PHOC models with rectangular regions outperform earlier PHOC models, and that despite their simplicity, PHOC models are surprisingly competitive with the state-of-the-art. PHOC is not math-specific, and might be used for chemical diagrams, charts, or other graphics.
