Towards Calibration Enhanced Network by Inverse Adversarial Attack
Yupeng Cheng, Zi Pong Lim, Sarthak Ketanbhai Modi, Yon Shin Teo, Yushi Cao, Shang-Wei Lin
TL;DR
This work tackles miscalibration in OCR-based HMI validation by introducing Inverse Adversarial Attack (IAA), which generates underconfident yet correctly classified examples, and Inverse Adversarial Training (IAT) to defend against such underconfidence. The authors design a two-term cross-entropy loss for IAA and derive its optimization properties, showing how the ground-truth probability can be driven toward underconfidence under controlled perturbations. Through extensive experiments on CIFAR-10 and qualitative NeurIPS data, they demonstrate that IAA effectively reveals underconfidence, and that integrating IAEs via IAT, especially when combined with standard adversarial training (AT), yields models that are both robust and well-calibrated. The work also discusses transferability, showing strong whitebox effects but limited cross-model transfer, and suggests future directions for enhancing cross-architecture applicability.
Abstract
Test automation has become increasingly important as the complexity of both design and content in Human Machine Interface (HMI) software continues to grow. Current standard practice uses Optical Character Recognition (OCR) techniques to automatically extract textual information from HMI screens for validation. At present, one of the key challenges faced during the automation of HMI screen validation is the noise handling for the OCR models. In this paper, we propose to utilize adversarial training techniques to enhance OCR models in HMI testing scenarios. More specifically, we design a new adversarial attack objective for OCR models to discover the decision boundaries in the context of HMI testing. We then adopt adversarial training to optimize the decision boundaries towards a more robust and accurate OCR model. In addition, we also built an HMI screen dataset based on real-world requirements and applied multiple types of perturbation onto the clean HMI dataset to provide a more complete coverage for the potential scenarios. We conduct experiments to demonstrate how using adversarial training techniques yields more robust OCR models against various kinds of noises, while still maintaining high OCR model accuracy. Further experiments even demonstrate that the adversarial training models exhibit a certain degree of robustness against perturbations from other patterns.
