Guidelines for External Disturbance Factors in the Use of OCR in Real-World Environments
Kenji Iwata, Eiki Ishidera, Toshifumi Yamaai, Yutaka Satoh, Hiroshi Tanaka, Katsuhiko Takahashi, Akio Furuhata, Yoshihisa Tanabe, Hiroshi Matsumura
TL;DR
Addressing OCR degradation from real-world disturbances, the paper builds a comprehensive external-disturbance factor table and associated guidelines. It defines non-uniform illumination levels and OCR grades, and catalogs illumination, obstacles, objects, camera/photographer, and scene factors with mappings to degradation phenomena. It then presents practical utilization methods: data augmentation, and reconfirmation workflows for users, to diagnose and mitigate issues. The resulting framework enables systematic quality control and more reliable OCR deployment in real-world environments.
Abstract
The performance of OCR has improved with the evolution of AI technology. As OCR continues to broaden its range of applications, the increased likelihood of interference introduced by various usage environments can prevent it from achieving its inherent performance. This results in reduced recognition accuracy under certain conditions, and makes the quality control of recognition devices more challenging. Therefore, to ensure that users can properly utilize OCR, we compiled the real-world external disturbance factors that cause performance degradation, along with the resulting image degradation phenomena, into an external disturbance factor table and, by also indicating how to make use of it, organized them into guidelines.
