Salutary Labeling with Zero Human Annotation
Wenxiao Xiao, Hongfu Liu
TL;DR
Salutary labeling tackles labeling cost and label quality in active learning by automatically selecting unlabeled samples and assigning salutary labels that maximize the estimated influence on a validation loss $\mathcal{L}_v$, as computed by the influence function $\mathcal{I}(x_j,y_j)$. By evaluating $\mathcal{I}(x_j,c)$ for all possible labels $c \in \mathcal{C}$ and choosing $\hat{c}=\operatorname{arg\,max}_c \mathcal{I}(x_j,c)$, the method merges querying and labeling into one autonomous step and selects the top $b$ samples per round. Empirical results across nine datasets and LLM-fine-tuning scenarios show consistently superior performance to traditional active-learning baselines and even surpass ground-truth labeling in some settings, all without human annotation. The approach leverages a convex surrogate for non-convex models via embeddings and demonstrates practical potential for cost-effective, data-efficient learning with broad applicability.
Abstract
Active learning strategically selects informative unlabeled data points and queries their ground truth labels for model training. The prevailing assumption underlying this machine learning paradigm is that acquiring these ground truth labels will optimally enhance model performance. However, this assumption may not always hold true or maximize learning capacity, particularly considering the costly labor annotations required for ground truth labels. In contrast to traditional ground truth labeling, this paper proposes salutary labeling, which automatically assigns the most beneficial labels to the most informative samples without human annotation. Specifically, we utilize the influence function, a tool for estimating sample influence, to select newly added samples and assign their salutary labels by choosing the category that maximizes their positive influence. This process eliminates the need for human annotation. Extensive experiments conducted on nine benchmark datasets demonstrate the superior performance of our salutary labeling approach over traditional active learning strategies. Additionally, we provide several in-depth explorations and practical applications of large language model (LLM) fine-tuning.
