AUGCAL: Improving Sim2Real Adaptation by Uncertainty Calibration on Augmented Synthetic Images
Prithvijit Chattopadhyay, Bharat Goyal, Boglarka Ecsedi, Viraj Prabhu, Judy Hoffman
TL;DR
AUGCAL addresses the persistent miscalibration and overconfidence seen in Sim2Real adaptations by introducing a training-time patch that combines strong synthetic augmentations with a calibration-focused loss on augmented predictions. By augmenting source (synthetic) images with Aug transformations (e.g., Pasta, RandAugment) and optimizing a calibration loss (e.g., DCA) alongside standard UDA objectives, AUGCAL tightens an upper bound on target calibration error that comprises both domain divergence and source calibration terms. Empirically, AUGCAL improves calibration metrics (ECE, IC-ECE, OC) and reliability (PRR) across semantic segmentation and object recognition benchmarks, while preserving or enhancing transfer performance across multiple base methods (Entropy Minimization, HRDA, SDAT) and backbones (CNNs and Transformers). The approach is lightweight, task-agnostic, and demonstrates practical impact for deploying more reliable Sim2Real models in real-world settings.
Abstract
Synthetic data (SIM) drawn from simulators have emerged as a popular alternative for training models where acquiring annotated real-world images is difficult. However, transferring models trained on synthetic images to real-world applications can be challenging due to appearance disparities. A commonly employed solution to counter this SIM2REAL gap is unsupervised domain adaptation, where models are trained using labeled SIM data and unlabeled REAL data. Mispredictions made by such SIM2REAL adapted models are often associated with miscalibration - stemming from overconfident predictions on real data. In this paper, we introduce AUGCAL, a simple training-time patch for unsupervised adaptation that improves SIM2REAL adapted models by - (1) reducing overall miscalibration, (2) reducing overconfidence in incorrect predictions and (3) improving confidence score reliability by better guiding misclassification detection - all while retaining or improving SIM2REAL performance. Given a base SIM2REAL adaptation algorithm, at training time, AUGCAL involves replacing vanilla SIM images with strongly augmented views (AUG intervention) and additionally optimizing for a training time calibration loss on augmented SIM predictions (CAL intervention). We motivate AUGCAL using a brief analytical justification of how to reduce miscalibration on unlabeled REAL data. Through our experiments, we empirically show the efficacy of AUGCAL across multiple adaptation methods, backbones, tasks and shifts.
