NexusFlow: Unifying Disparate Tasks under Partial Supervision via Invertible Flow Networks
Fangzhou Lin, Yuping Wang, Yuliang Guo, Zixun Huang, Xinyu Huang, Haichong Zhang, Kazunori Yamada, Zhengzhong Tu, Liu Ren, Ziming Zhang
TL;DR
This work tackles Partially Supervised Multi-Task Learning when tasks are structurally different and supervision is domain-partitioned. It introduces NexusFlow, a plug-and-play framework that uses per-task surrogate modules with invertible coupling layers to map task features into a shared latent space for distribution alignment, preserving information and avoiding collapse. The approach is theoretically justified via a Lipschitz-based bound and empirically validated on nuScenes (domain-partitioned autonomous driving) and NYU-V2 (multi-task indoor perception), showing improved cross-task transfer and state-of-the-art performance under PS-MTL. The results demonstrate broad applicability, improved alignment of heterogeneous tasks, and efficiency advantages over prior PS-MTL methods designed for homogeneous tasks.
Abstract
Partially Supervised Multi-Task Learning (PS-MTL) aims to leverage knowledge across tasks when annotations are incomplete. Existing approaches, however, have largely focused on the simpler setting of homogeneous, dense prediction tasks, leaving the more realistic challenge of learning from structurally diverse tasks unexplored. To this end, we introduce NexusFlow, a novel, lightweight, and plug-and-play framework effective in both settings. NexusFlow introduces a set of surrogate networks with invertible coupling layers to align the latent feature distributions of tasks, creating a unified representation that enables effective knowledge transfer. The coupling layers are bijective, preserving information while mapping features into a shared canonical space. This invertibility avoids representational collapse and enables alignment across structurally different tasks without reducing expressive capacity. We first evaluate NexusFlow on the core challenge of domain-partitioned autonomous driving, where dense map reconstruction and sparse multi-object tracking are supervised in different geographic regions, creating both structural disparity and a strong domain gap. NexusFlow sets a new state-of-the-art result on nuScenes, outperforming strong partially supervised baselines. To demonstrate generality, we further test NexusFlow on NYUv2 using three homogeneous dense prediction tasks, segmentation, depth, and surface normals, as a representative N-task PS-MTL scenario. NexusFlow yields consistent gains across all tasks, confirming its broad applicability.
