ProTeCt: Prompt Tuning for Taxonomic Open Set Classification
Tz-Ying Wu, Chih-Hui Ho, Nuno Vasconcelos
TL;DR
ProTeCt addresses the challenge of taxonomic open set (TOS) classification, where predictions must be consistent across multiple hierarchical levels. The authors introduce two metrics, hierarchical consistent accuracy (HCA) and mean treecut accuracy (MTA), to evaluate hierarchical reliability beyond leaf-level accuracy. They propose ProTeCt, a plug-in prompt-tuning framework that jointly optimizes a node-centric loss and a dynamic treecut loss to enforce consistency across a taxonomy during training, while preserving leaf accuracy. Experiments show substantial gains in HCA and MTA across CIFAR-100, SUN, and ImageNet, with successful domain generalization to unseen image domains and compatibility with various CLIP architectures, prompt methods, and adapters. The work provides a practical path to reliable, multi-granularity classification in real-world applications that require predictions at different taxonomic levels.
Abstract
Visual-language foundation models, like CLIP, learn generalized representations that enable zero-shot open-set classification. Few-shot adaptation methods, based on prompt tuning, have been shown to further improve performance on downstream datasets. However, these methods do not fare well in the taxonomic open set (TOS) setting, where the classifier is asked to make predictions from label sets across different levels of semantic granularity. Frequently, they infer incorrect labels at coarser taxonomic class levels, even when the inference at the leaf level (original class labels) is correct. To address this problem, we propose a prompt tuning technique that calibrates the hierarchical consistency of model predictions. A set of metrics of hierarchical consistency, the Hierarchical Consistent Accuracy (HCA) and the Mean Treecut Accuracy (MTA), are first proposed to evaluate TOS model performance. A new Prompt Tuning for Hierarchical Consistency (ProTeCt) technique is then proposed to calibrate classification across label set granularities. Results show that ProTeCt can be combined with existing prompt tuning methods to significantly improve TOS classification without degrading the leaf level classification performance.
