TADFormer : Task-Adaptive Dynamic Transformer for Efficient Multi-Task Learning
Seungmin Baek, Soyul Lee, Hayeon Jo, Hyesong Choi, Dongbo Min
TL;DR
TADFormer tackles the efficiency challenge of multi-task learning in vision by introducing task prompts and a Dynamic Task Filter (DTF) within a task-adaptive Transformer framework. The approach combines a parameter-efficient shared TS-Module with a Task-Prompt Conditional (TPC) operator and a TA-Module that uses DTF to generate input-context-aware, task-specific features. Quantitative results on PASCAL-Context show superior accuracy and up to 8.4x fewer trainable parameters than full fine-tuning, with further gains when using larger backbones or alternative decoders. The method also proves compatible with adapter-based PEFT schemes, underscoring its flexibility and practical impact for scalable, multi-task dense prediction.
Abstract
Transfer learning paradigm has driven substantial advancements in various vision tasks. However, as state-of-the-art models continue to grow, classical full fine-tuning often becomes computationally impractical, particularly in multi-task learning (MTL) setup where training complexity increases proportional to the number of tasks. Consequently, recent studies have explored Parameter-Efficient Fine-Tuning (PEFT) for MTL architectures. Despite some progress, these approaches still exhibit limitations in capturing fine-grained, task-specific features that are crucial to MTL. In this paper, we introduce Task-Adaptive Dynamic transFormer, termed TADFormer, a novel PEFT framework that performs task-aware feature adaptation in the fine-grained manner by dynamically considering task-specific input contexts. TADFormer proposes the parameter-efficient prompting for task adaptation and the Dynamic Task Filter (DTF) to capture task information conditioned on input contexts. Experiments on the PASCAL-Context benchmark demonstrate that the proposed method achieves higher accuracy in dense scene understanding tasks, while reducing the number of trainable parameters by up to 8.4 times when compared to full fine-tuning of MTL models. TADFormer also demonstrates superior parameter efficiency and accuracy compared to recent PEFT methods.
