Unprejudiced Training Auxiliary Tasks Makes Primary Better: A Multi-Task Learning Perspective
Yuanze Li, Chun-Mei Feng, Qilong Wang, Guanglei Yang, Wangmeng Zuo
TL;DR
The paper tackles the problem that auxiliary tasks in multi-task learning are often under-trained, limiting their benefit to the primary task. It proposes Impartial Auxiliary Learning (IAL), a two-stage framework that uses uncertainty-based weights to train decoders impartially and gradient-normalization coupled with a task-uncertainty–driven function to balance encoder training in favor of the primary task. Through extensive experiments on NYUv2, Cityscapes, Pascal Context, and a multi-domain CIFAR-100 setting, IAL achieves state-of-the-art or competitive results and demonstrates robustness to noisy pseudo-tasks. The method also includes thorough ablations and analyses of balancing strategies, and highlights its potential applicability to large pre-trained model–driven auxiliary tasks. Overall, IAL advances practical multi-task learning by ensuring high-quality training of auxiliary tasks to bolster the primary task while maintaining stability and efficiency.
Abstract
Human beings can leverage knowledge from relative tasks to improve learning on a primary task. Similarly, multi-task learning methods suggest using auxiliary tasks to enhance a neural network's performance on a specific primary task. However, previous methods often select auxiliary tasks carefully but treat them as secondary during training. The weights assigned to auxiliary losses are typically smaller than the primary loss weight, leading to insufficient training on auxiliary tasks and ultimately failing to support the main task effectively. To address this issue, we propose an uncertainty-based impartial learning method that ensures balanced training across all tasks. Additionally, we consider both gradients and uncertainty information during backpropagation to further improve performance on the primary task. Extensive experiments show that our method achieves performance comparable to or better than state-of-the-art approaches. Moreover, our weighting strategy is effective and robust in enhancing the performance of the primary task regardless the noise auxiliary tasks' pseudo labels.
