Experts-Guided Unbalanced Optimal Transport for ISP Learning from Unpaired and/or Paired Data
Georgy Perevozchikov, Nancy Mehta, Egor Ershov, Radu Timofte
TL;DR
This work tackles the reliance on costly paired raw-to-sRGB data by introducing Experts-Guided Unbalanced Optimal Transport (EGUOT) for ISP learning. EGUOT couples Unbalanced OT with a Committee of Expert Discriminators to guide a transport-based raw-to-sRGB mapping, enabling effective training in both unpaired and paired settings and offering robustness to dataset outliers. Across three diverse datasets, EGUOT achieves state-of-the-art unpaired performance and consistently improves or matches paired baselines, demonstrating architecture-agnostic adaptability. Ablation studies confirm the necessity of the UOT objective and the expert committee, highlighting improved color fidelity, texture, and artifact suppression, with practical implications for scalable ISP development.
Abstract
Learned Image Signal Processing (ISP) pipelines offer powerful end-to-end performance but are critically dependent on large-scale paired raw-to-sRGB datasets. This reliance on costly-to-acquire paired data remains a significant bottleneck. To address this challenge, we introduce a novel, unsupervised training framework based on Optimal Transport capable of training arbitrary ISP architectures in both unpaired and paired modes. We are the first to successfully apply Unbalanced Optimal Transport (UOT) for this complex, cross-domain translation task. Our UOT-based framework provides robustness to outliers in the target sRGB data, allowing it to discount atypical samples that would be prohibitively costly to map. A key component of our framework is a novel ``committee of expert discriminators,'' a hybrid adversarial regularizer. This committee guides the optimal transport mapping by providing specialized, targeted gradients to correct specific ISP failure modes, including color fidelity, structural artifacts, and frequency-domain realism. To demonstrate the superiority of our approach, we retrained existing state-of-the-art ISP architectures using our paired and unpaired setups. Our experiments show that while our framework, when trained in paired mode, exceeds the performance of the original paired methods across all metrics, our unpaired mode concurrently achieves quantitative and qualitative performance that rivals, and in some cases surpasses, the original paired-trained counterparts. The code and pre-trained models are available at: https://github.com/gosha20777/EGUOT-ISP.git.
