Exploring Transferability of Self-Supervised Learning by Task Conflict Calibration

Huijie Guo; Jingyao Wang; Peizheng Guo; Xingchen Shen; Changwen Zheng; Wenwen Qiang

Exploring Transferability of Self-Supervised Learning by Task Conflict Calibration

Huijie Guo, Jingyao Wang, Peizheng Guo, Xingchen Shen, Changwen Zheng, Wenwen Qiang

TL;DR

This work addresses the transferability of self-supervised representations and identifies task conflict as a key barrier when SSL is trained with multiple tasks per batch. It introduces Task Conflict Calibration (TC^2), which constructs multiple SSL tasks within each batch and learns a shared factor basis $V$ and per-sample weights $W_j=f_w(z_j)V^T$ to isolate task-relevant semantics via a two-stage bi-level optimization. The method enforces factor orthogonality and data reconstruction through $\mathcal{L}_v$ and promotes sparsity in sample-factor weights via $\mathcal{L}_w$, combining them into $\mathcal{L}_{tc^2}=\mathcal{L}_v+\lambda_s\mathcal{L}_w$; TC^2 is then optimized in a two-stage process to maximize transferability while suppressing inter-task interference. Empirical results across transfer learning, video-based tasks, and OOD benchmarks show consistent improvements over strong SSL baselines, validating the approach and underscoring the importance of modeling task distributions within SSL pipelines. Overall, TC^2 provides a practical, plug-in mechanism to improve SSL transferability by calibrating task-level representations and mitigating task conflicts.

Abstract

In this paper, we explore the transferability of SSL by addressing two central questions: (i) what is the representation transferability of SSL, and (ii) how can we effectively model this transferability? Transferability is defined as the ability of a representation learned from one task to support the objective of another. Inspired by the meta-learning paradigm, we construct multiple SSL tasks within each training batch to support explicitly modeling transferability. Based on empirical evidence and causal analysis, we find that although introducing task-level information improves transferability, it is still hindered by task conflict. To address this issue, we propose a Task Conflict Calibration (TC$^2$) method to alleviate the impact of task conflict. Specifically, it first splits batches to create multiple SSL tasks, infusing task-level information. Next, it uses a factor extraction network to produce causal generative factors for all tasks and a weight extraction network to assign dedicated weights to each sample, employing data reconstruction, orthogonality, and sparsity to ensure effectiveness. Finally, TC$^2$ calibrates sample representations during SSL training and integrates into the pipeline via a two-stage bi-level optimization framework to boost the transferability of learned representations. Experimental results on multiple downstream tasks demonstrate that our method consistently improves the transferability of SSL models.

Exploring Transferability of Self-Supervised Learning by Task Conflict Calibration

TL;DR

and per-sample weights

to isolate task-relevant semantics via a two-stage bi-level optimization. The method enforces factor orthogonality and data reconstruction through

and promotes sparsity in sample-factor weights via

, combining them into

; TC^2 is then optimized in a two-stage process to maximize transferability while suppressing inter-task interference. Empirical results across transfer learning, video-based tasks, and OOD benchmarks show consistent improvements over strong SSL baselines, validating the approach and underscoring the importance of modeling task distributions within SSL pipelines. Overall, TC^2 provides a practical, plug-in mechanism to improve SSL transferability by calibrating task-level representations and mitigating task conflicts.

Abstract

) method to alleviate the impact of task conflict. Specifically, it first splits batches to create multiple SSL tasks, infusing task-level information. Next, it uses a factor extraction network to produce causal generative factors for all tasks and a weight extraction network to assign dedicated weights to each sample, employing data reconstruction, orthogonality, and sparsity to ensure effectiveness. Finally, TC

calibrates sample representations during SSL training and integrates into the pipeline via a two-stage bi-level optimization framework to boost the transferability of learned representations. Experimental results on multiple downstream tasks demonstrate that our method consistently improves the transferability of SSL models.

Exploring Transferability of Self-Supervised Learning by Task Conflict Calibration

TL;DR

Abstract

Exploring Transferability of Self-Supervised Learning by Task Conflict Calibration

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (3)