Label Distribution Learning-Enhanced Dual-KNN for Text Classification

Bo Yuan; Yulin Chen; Zhen Tan; Wang Jinyan; Huan Liu; Yin Zhang

Label Distribution Learning-Enhanced Dual-KNN for Text Classification

Bo Yuan, Yulin Chen, Zhen Tan, Wang Jinyan, Huan Liu, Yin Zhang

TL;DR

This work tackles text classification by exploiting internal model information through a dual $k$NN (D$k$NN) framework that retrieves neighbors using both text embeddings and predicted label distributions. A label distribution learning (LL) module learns label similarity and uses contrastive learning to produce more discriminative label representations, improving both the base model and the quality of retrieved neighbors. Empirical results across five datasets show consistent accuracy gains and enhanced robustness to noisy labels, outperforming various baselines and ablations. The approach advances retrieval-augmented NLP by leveraging intermediate representations and label correlations to enable more robust and reliable predictions.

Abstract

Many text classification methods usually introduce external information (e.g., label descriptions and knowledge bases) to improve the classification performance. Compared to external information, some internal information generated by the model itself during training, like text embeddings and predicted label probability distributions, are exploited poorly when predicting the outcomes of some texts. In this paper, we focus on leveraging this internal information, proposing a dual $k$ nearest neighbor (D$k$NN) framework with two $k$NN modules, to retrieve several neighbors from the training set and augment the distribution of labels. For the $k$NN module, it is easily confused and may cause incorrect predictions when retrieving some nearest neighbors from noisy datasets (datasets with labeling errors) or similar datasets (datasets with similar labels). To address this issue, we also introduce a label distribution learning module that can learn label similarity, and generate a better label distribution to help models distinguish texts more effectively. This module eases model overfitting and improves final classification performance, hence enhancing the quality of the retrieved neighbors by $k$NN modules during inference. Extensive experiments on the benchmark datasets verify the effectiveness of our method.

Label Distribution Learning-Enhanced Dual-KNN for Text Classification

TL;DR

This work tackles text classification by exploiting internal model information through a dual

NN (D

NN) framework that retrieves neighbors using both text embeddings and predicted label distributions. A label distribution learning (LL) module learns label similarity and uses contrastive learning to produce more discriminative label representations, improving both the base model and the quality of retrieved neighbors. Empirical results across five datasets show consistent accuracy gains and enhanced robustness to noisy labels, outperforming various baselines and ablations. The approach advances retrieval-augmented NLP by leveraging intermediate representations and label correlations to enable more robust and reliable predictions.

Abstract

nearest neighbor (D

NN) framework with two

NN modules, to retrieve several neighbors from the training set and augment the distribution of labels. For the

NN module, it is easily confused and may cause incorrect predictions when retrieving some nearest neighbors from noisy datasets (datasets with labeling errors) or similar datasets (datasets with similar labels). To address this issue, we also introduce a label distribution learning module that can learn label similarity, and generate a better label distribution to help models distinguish texts more effectively. This module eases model overfitting and improves final classification performance, hence enhancing the quality of the retrieved neighbors by

NN modules during inference. Extensive experiments on the benchmark datasets verify the effectiveness of our method.

Label Distribution Learning-Enhanced Dual-KNN for Text Classification

TL;DR

Abstract

Label Distribution Learning-Enhanced Dual-KNN for Text Classification

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)