FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures
Hao Lu, Wenze Liu, Hongtao Fu, Zhiguo Cao
TL;DR
FADE tackles the challenge of a truly task-agnostic upsampling operator for dense prediction by fusing encoder and decoder features to generate content-aware upsampling kernels. Its semi-shift convolution unifies interpolation, channel compression, and kernel generation, while a decoder-dependent gate selectively passes high-resolution encoder details to refine edges. Across semantic segmentation, image matting, object/detection, instance segmentation, and depth estimation, FADE demonstrates consistent improvements over fixed and prior dynamic upsampling methods, with a lightweight variant (FADE-Lite) maintaining strong performance at reduced cost. The method shifts the design focus to high-quality upsampling as a general-purpose component rather than task-specific tailoring, potentially influencing future encoder–decoder architectures and the development of vision foundation models. Overall, FADE provides robust, generalizable upsampling that improves region coherence and boundary delineation with practical efficiency considerations.
Abstract
The goal of this work is to develop a task-agnostic feature upsampling operator for dense prediction where the operator is required to facilitate not only region-sensitive tasks like semantic segmentation but also detail-sensitive tasks such as image matting. Prior upsampling operators often can work well in either type of the tasks, but not both. We argue that task-agnostic upsampling should dynamically trade off between semantic preservation and detail delineation, instead of having a bias between the two properties. In this paper, we present FADE, a novel, plug-and-play, lightweight, and task-agnostic upsampling operator by fusing the assets of decoder and encoder features at three levels: i) considering both the encoder and decoder feature in upsampling kernel generation; ii) controlling the per-point contribution of the encoder/decoder feature in upsampling kernels with an efficient semi-shift convolutional operator; and iii) enabling the selective pass of encoder features with a decoder-dependent gating mechanism for compensating details. To improve the practicality of FADE, we additionally study parameter- and memory-efficient implementations of semi-shift convolution. We analyze the upsampling behavior of FADE on toy data and show through large-scale experiments that FADE is task-agnostic with consistent performance improvement on a number of dense prediction tasks with little extra cost. For the first time, we demonstrate robust feature upsampling on both region- and detail-sensitive tasks successfully. Code is made available at: https://github.com/poppinace/fade
