On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction

Jianwei Wang; Tianyin Wang; Ziqian Zeng

On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction

Jianwei Wang, Tianyin Wang, Ziqian Zeng

TL;DR

This work tackles the reliance of information extraction on gold-standard data by exploiting silver standard data generated by pre-trained models in zero-shot settings. The proposed Clean-LaVe framework leverages LaVeEntail as a backbone and introduces two novel components, Iteratively Weighted Negative Learning and Class-Aware Data Selector, to identify and utilize clean silver labels for finetuning a textual entailment model. Empirical results across zero-shot relation extraction, cross-lingual relation extraction, and event argument classification demonstrate substantial improvements over strong baselines, including in cross-lingual scenarios, with publicly available code. Overall, Clean-LaVe offers a practical approach to boost zero-shot IE performance by effectively filtering and exploiting noisy silver data produced by existing NLP models.

Abstract

The superior performance of supervised classification methods in the information extraction (IE) area heavily relies on a large amount of gold standard data. Recent zero-shot classification methods converted the task to other NLP tasks (e.g., textual entailment) and used off-the-shelf models of these NLP tasks to directly perform inference on the test data without using a large amount of IE annotation data. A potentially valuable by-product of these methods is the large-scale silver standard data, i.e., pseudo-labeled data by the off-the-shelf models of other NLP tasks. However, there is no further investigation into the use of these data. In this paper, we propose a new framework, Clean-LaVe, which aims to utilize silver standard data to enhance the zero-shot performance. Clean-LaVe includes four phases: (1) Obtaining silver data; (2) Identifying relatively clean data from silver data; (3) Finetuning the off-the-shelf model using clean data; (4) Inference on the test data. The experimental results show that Clean-LaVe can outperform the baseline by 5% and 6% on TACRED and Wiki80 dataset in the zero-shot relation classification task, and by 3%-7% on Smile (Korean and Polish) in the zero-shot cross-lingual relation classification task, and by 8% on ACE05-E+ in the zero-shot event argument classification task. The code is share in https://github.com/wjw136/Clean_LaVe.git.

On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction

TL;DR

Abstract

Paper Structure (25 sections, 3 equations, 3 figures, 4 tables, 2 algorithms)

This paper contains 25 sections, 3 equations, 3 figures, 4 tables, 2 algorithms.

Introduction
Related Work
Zero-shot Relation Extraction.
Zero-shot Cross-lingual Information Extraction.
Zero-shot Event Argument Classification.
Learning with Noisy Labels.
Method
LaVeEntail
Label Verbalization
Textual Entailment Model Inference
Clean Data Detection
Iteratively Weighted Negative Learning
Class-Aware Data Selector
Finetuning and Inference
Experiment
...and 10 more sections

Figures (3)

Figure 1: The diagram shows the procedure of Clean-LaVe in the zero-shot relation extraction task. First, we apply LaVeEntail on an unlabeled dataset, obtaining standard silver data. The clean data detection module uses confidence scores to distinguish the clean and noisy samples. The selected clean data are used to finetune the TE model. Finally, we use this TE model to infer relation types on the test set.
Figure 2: Analysis of IWNL and CADS on TACRED.
Figure 3: Results of different $\eta$ and $m$.

On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction

TL;DR

Abstract

On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction

Authors

TL;DR

Abstract

Table of Contents

Figures (3)