HyperFlow: Gradient-Free Emulation of Few-Shot Fine-Tuning

Donggyun Kim; Chanwoo Kim; Seunghoon Hong

HyperFlow: Gradient-Free Emulation of Few-Shot Fine-Tuning

Donggyun Kim, Chanwoo Kim, Seunghoon Hong

TL;DR

HyperFlow tackles the high compute and memory cost of test-time fine-tuning in few-shot classification by learning a gradient-free surrogate for gradient descent. It trains a conditional drift network to predict the parameter drift conditioned on a small support set, and performs adaptation via a short ODE integration, updating only a compact subset of target parameters through bias-tuning. The method yields substantial improvements in out-of-domain performance on Meta-Dataset and CDFSL benchmarks while incurring only a small fraction of the memory and time costs of standard fine-tuning. This establishes a practical middle ground between direct transfer and full fine-tuning, enabling real-time or resource-constrained adaptation for cross-domain FSC tasks.

Abstract

While test-time fine-tuning is beneficial in few-shot learning, the need for multiple backpropagation steps can be prohibitively expensive in real-time or low-resource scenarios. To address this limitation, we propose an approach that emulates gradient descent without computing gradients, enabling efficient test-time adaptation. Specifically, we formulate gradient descent as an Euler discretization of an ordinary differential equation (ODE) and train an auxiliary network to predict the task-conditional drift using only the few-shot support set. The adaptation then reduces to a simple numerical integration (e.g., via the Euler method), which requires only a few forward passes of the auxiliary network -- no gradients or forward passes of the target model are needed. In experiments on cross-domain few-shot classification using the Meta-Dataset and CDFSL benchmarks, our method significantly improves out-of-domain performance over the non-fine-tuned baseline while incurring only 6\% of the memory cost and 0.02\% of the computation time of standard fine-tuning, thus establishing a practical middle ground between direct transfer and fully fine-tuned approaches.

HyperFlow: Gradient-Free Emulation of Few-Shot Fine-Tuning

TL;DR

Abstract

HyperFlow: Gradient-Free Emulation of Few-Shot Fine-Tuning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)