Transfer Learning of Surrogate Models: Integrating Domain Warping and Affine Transformations

Shuaiqun Pan; Diederick Vermetten; Manuel López-Ibáñez; Thomas Bäck; Hao Wang

Transfer Learning of Surrogate Models: Integrating Domain Warping and Affine Transformations

Shuaiqun Pan, Diederick Vermetten, Manuel López-Ibáñez, Thomas Bäck, Hao Wang

TL;DR

The work addresses transferring surrogate models across tasks under nonlinear covariate shifts by jointly learning a nonlinear input warp implemented via per-dimension beta CDFs and an affine transformation, formalized as $f^{\mathrm{T}}(x)=f^{\mathrm{S}}(g(x))$ with $g(x)=W\phi(x)+v$ and $W\in SO(d)$. It provides both differentiable (with Riemannian gradient on $SO(d)$) and non-differentiable (via CMA-ES with a Lie algebra parameterization) pathways to fit the transformation using a small transfer dataset and empirical loss. Empirical results on the Black-Box Optimization Benchmark (BBOB) and an automotive ABS task show data-efficient gains in low-data regimes, particularly in higher dimensions, though the advantage declines as transfer data increases and on highly multimodal landscapes. The approach demonstrates practical potential for data-efficient surrogate reuse, with promising directions including active learning, extension to other regressors, and exploring faster warp alternatives like Kumaraswamy warping.

Abstract

Surrogate models provide efficient alternatives to computationally demanding real world processes but often require large datasets for effective training. A promising solution to this limitation is the transfer of pre-trained surrogate models to new tasks. Previous studies have investigated the transfer of differentiable and non-differentiable surrogate models, typically assuming an affine transformation between the source and target functions. This paper extends previous research by addressing a broader range of transformations, including linear and nonlinear variations. Specifically, we consider the combination of an unknown input warping, such as one modeled by the beta cumulative distribution function, with an unspecified affine transformation. Our approach achieves transfer learning by employing a limited number of data points from the target task to optimize these transformations, minimizing empirical loss on the transfer dataset. We validate the proposed method on the widely used Black-Box Optimization Benchmark (BBOB) testbed and a real-world transfer learning task from the automobile industry. The results underscore the significant advantages of the approach, revealing that the transferred surrogate significantly outperforms both the original surrogate and the one built from scratch using the transfer dataset, particularly in data-scarce scenarios.

Transfer Learning of Surrogate Models: Integrating Domain Warping and Affine Transformations

TL;DR

Abstract

Transfer Learning of Surrogate Models: Integrating Domain Warping and Affine Transformations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (23)

Theorems & Definitions (1)