Billet Number Recognition Based on Test-Time Adaptation
Yuan Wei, Xiuzhuang Zhou
TL;DR
The paper tackles real-time billet number recognition under production conditions, where distribution shifts and damaged characters hinder accuracy. It introduces a two-pronged approach: (i) test-time adaptation by minimizing entropy $H(oldsymbol{ ilde{y}})$ during testing to adapt a DB+SVTR pipeline without labels, updating only BatchNorm parameters, and (ii) inference-time priors based on billet-number encoding rules and a damaged-character correction for CTC to enforce valid outputs and recover from missing characters. The method uses Differentiable Binarization (DB) for detection and SVTR for recognition, with post-processing that enforces encoding constraints and improves CTC handling of blanks in damaged regions. Experiments on machine-printed and handwritten billets show substantial gains, with accuracy improving from baseline 0.58 to 0.79 on machine-printed and from 0.23 to 0.70 on handwritten numbers when both TTA and prior-knowledge constraints are employed, demonstrating practical impact in industrial settings.
Abstract
During the steel billet production process, it is essential to recognize machine-printed or manually written billet numbers on moving billets in real-time. To address the issue of low recognition accuracy for existing scene text recognition methods, caused by factors such as image distortions and distribution differences between training and test data, we propose a billet number recognition method that integrates test-time adaptation with prior knowledge. First, we introduce a test-time adaptation method into a model that uses the DB network for text detection and the SVTR network for text recognition. By minimizing the model's entropy during the testing phase, the model can adapt to the distribution of test data without the need for supervised fine-tuning. Second, we leverage the billet number encoding rules as prior knowledge to assess the validity of each recognition result. Invalid results, which do not comply with the encoding rules, are replaced. Finally, we introduce a validation mechanism into the CTC algorithm using prior knowledge to address its limitations in recognizing damaged characters. Experimental results on real datasets, including both machine-printed billet numbers and handwritten billet numbers, show significant improvements in evaluation metrics, validating the effectiveness of the proposed method.
