A High-Dimensional Statistical Method for Optimizing Transfer Quantities in Multi-Source Transfer Learning

Qingyue Zhang; Haohao Fu; Guanbo Huang; Yaoyuan Liang; Chang Chu; Tianren Peng; Yanru Wu; Qi Li; Yang Li; Shao-Lun Huang

A High-Dimensional Statistical Method for Optimizing Transfer Quantities in Multi-Source Transfer Learning

Qingyue Zhang, Haohao Fu, Guanbo Huang, Yaoyuan Liang, Chang Chu, Tianren Peng, Yanru Wu, Qi Li, Yang Li, Shao-Lun Huang

TL;DR

This work tackles how many samples to transfer from each source task in multi-source transfer learning by introducing a KL-divergence-based generalization error and analyzing it with high-dimensional statistics. It derives explicit optimal transfer-quantity formulas for single- and multi-source settings and presents OTQMS, an architecture-agnostic, data-efficient training algorithm with a dynamic sampling strategy guided by the Fisher information. Theoretical results are complemented by extensive experiments on DomainNet, Office-Home, and Digits, where OTQMS achieves higher accuracy and substantial data-time savings compared with baselines. The approach broadens the applicability of multi-source transfer by enabling shot-general, domain-aware transfer quantity optimization with practical, scalable training. Overall, OTQMS demonstrates that carefully selecting transfer quantities rather than exhaustively using all source data yields meaningful gains in both performance and efficiency in real-world, few-shot transfer learning scenarios.

Abstract

Multi-source transfer learning provides an effective solution to data scarcity in real-world supervised learning scenarios by leveraging multiple source tasks. In this field, existing works typically use all available samples from sources in training, which constrains their training efficiency and may lead to suboptimal results. To address this, we propose a theoretical framework that answers the question: what is the optimal quantity of source samples needed from each source task to jointly train the target model? Specifically, we introduce a generalization error measure based on K-L divergence, and minimize it based on high-dimensional statistical analysis to determine the optimal transfer quantity for each source task. Additionally, we develop an architecture-agnostic and data-efficient algorithm OTQMS to implement our theoretical results for target model training in multi-source transfer learning. Experimental studies on diverse architectures and two real-world benchmark datasets show that our proposed algorithm significantly outperforms state-of-the-art approaches in both accuracy and data efficiency. The code and supplementary materials are available in https://github.com/zqy0126/OTQMS.

A High-Dimensional Statistical Method for Optimizing Transfer Quantities in Multi-Source Transfer Learning

TL;DR

Abstract

A High-Dimensional Statistical Method for Optimizing Transfer Quantities in Multi-Source Transfer Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (15)