Manifold-based Verbalizer Space Re-embedding for Tuning-free Prompt-based Classification
Haochun Wang, Sendong Zhao, Chi Liu, Nuwa Xi, Muzhen Cai, Bing Qin, Ting Liu
TL;DR
This work tackles the tuning burden in prompt-based classification by recognizing that verbalizer embeddings often lie on non-linear manifolds, where Euclidean distance is misleading. It introduces Locally Linear Embedding with Intra-class Neighborhood Constraint (LLE-INC), a tuning-free method that re-embeds the verbalizer space to preserve intra-class locality, followed by kNN classification in the re-embedded space. A contrasting updating module using InfoNCE further enhances PTM representations when tuning is allowed. Across 10 benchmarks and large language models like LLaMA, LLE-INC demonstrates competitive or superior performance without tuning, and gains when combined with parameter updates, underscoring its practical potential for resource-efficient classification on hyper-scale models.
Abstract
Prompt-based classification adapts tasks to a cloze question format utilizing the [MASK] token and the filled tokens are then mapped to labels through pre-defined verbalizers. Recent studies have explored the use of verbalizer embeddings to reduce labor in this process. However, all existing studies require a tuning process for either the pre-trained models or additional trainable embeddings. Meanwhile, the distance between high-dimensional verbalizer embeddings should not be measured by Euclidean distance due to the potential for non-linear manifolds in the representation space. In this study, we propose a tuning-free manifold-based space re-embedding method called Locally Linear Embedding with Intra-class Neighborhood Constraint (LLE-INC) for verbalizer embeddings, which preserves local properties within the same class as guidance for classification. Experimental results indicate that even without tuning any parameters, our LLE-INC is on par with automated verbalizers with parameter tuning. And with the parameter updating, our approach further enhances prompt-based tuning by up to 3.2%. Furthermore, experiments with the LLaMA-7B&13B indicate that LLE-INC is an efficient tuning-free classification approach for the hyper-scale language models.
