DynaPrompt: Dynamic Test-Time Prompt Tuning

Zehao Xiao; Shilin Yan; Jack Hong; Jiayin Cai; Xiaolong Jiang; Yao Hu; Jiayi Shen; Qi Wang; Cees G. M. Snoek

DynaPrompt: Dynamic Test-Time Prompt Tuning

Zehao Xiao, Shilin Yan, Jack Hong, Jiayin Cai, Xiaolong Jiang, Yao Hu, Jiayi Shen, Qi Wang, Cees G. M. Snoek

TL;DR

DynaPrompt tackles distribution shifts in vision-language models by introducing a dynamic test-time prompt-tuning framework. It maintains an online prompt buffer, selectively updating and appending prompts per test sample using entropy and probability-difference metrics to avoid collapse and exploit relevant information. By combining dynamic selection with a controlled buffer size and a prompt-append/delete policy, it achieves robust improvements across domain generalization and cross-dataset benchmarks, and can augment existing prompt-tuning methods. The approach offers practical gains for real-time adaptation with manageable computation, enabling stronger, data-aware test-time performance.

Abstract

Test-time prompt tuning enhances zero-shot generalization of vision-language models but tends to ignore the relatedness among test samples during inference. Online test-time prompt tuning provides a simple way to leverage the information in previous test samples, albeit with the risk of prompt collapse due to error accumulation. To enhance test-time prompt tuning, we propose DynaPrompt, short for dynamic test-time prompt tuning, exploiting relevant data distribution information while reducing error accumulation. Built on an online prompt buffer, DynaPrompt adaptively selects and optimizes the relevant prompts for each test sample during tuning. Specifically, we introduce a dynamic prompt selection strategy based on two metrics: prediction entropy and probability difference. For unseen test data information, we develop dynamic prompt appending, which allows the buffer to append new prompts and delete the inactive ones. By doing so, the prompts are optimized to exploit beneficial information on specific test data, while alleviating error accumulation. Experiments on fourteen datasets demonstrate the effectiveness of dynamic test-time prompt tuning.

DynaPrompt: Dynamic Test-Time Prompt Tuning

TL;DR

Abstract

DynaPrompt: Dynamic Test-Time Prompt Tuning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)