Cross-Domain Transfer Learning with CoRTe: Consistent and Reliable Transfer from Black-Box to Lightweight Segmentation Model
Claudia Cuttano, Antonio Tavera, Fabio Cermelli, Giuseppe Averta, Barbara Caputo
TL;DR
The paper addresses the practical challenge of transferring knowledge from a black-box semantic segmentation model to a lightweight target model using only unlabeled target data, without access to source data. It introduces CoRTe, which combines Robust Relative Confidence Pseudo-Labelling to extract reliable pseudo-labels, a target-domain pseudo-label refinement via an EMA-based teacher, and a consistency-regularized training scheme with strong augmentations. Empirical results on GTA5→Cityscapes and SYNTHIA→Cityscapes show CoRTe outperforming baseline black-box methods and approaching or matching the Target-only upper bound in several cases, illustrating effective domain transfer under restrictive data access. The approach enables reliable, efficient deployment of segmentation models on edge devices where source data and transparent models are not accessible, with practical impact for privacy-preserving and commercial settings.
Abstract
Many practical applications require training of semantic segmentation models on unlabelled datasets and their execution on low-resource hardware. Distillation from a trained source model may represent a solution for the first but does not account for the different distribution of the training data. Unsupervised domain adaptation (UDA) techniques claim to solve the domain shift, but in most cases assume the availability of the source data or an accessible white-box source model, which in practical applications are often unavailable for commercial and/or safety reasons. In this paper, we investigate a more challenging setting in which a lightweight model has to be trained on a target unlabelled dataset for semantic segmentation, under the assumption that we have access only to black-box source model predictions. Our method, named CoRTe, consists of (i) a pseudo-labelling function that extracts reliable knowledge from the black-box source model using its relative confidence, (ii) a pseudo label refinement method to retain and enhance the novel information learned by the student model on the target data, and (iii) a consistent training of the model using the extracted pseudo labels. We benchmark CoRTe on two synthetic-to-real settings, demonstrating remarkable results when using black-box models to transfer knowledge on lightweight models for a target data distribution.
