Generative Meta-Learning for Zero-Shot Relation Triplet Extraction
Wanli Li, Tieyun Qian, Yi Song, Zeyu Zhang, Jiawei Li, Zhuang Chen, Lixin Zou
TL;DR
Zero-shot Relation Triplet Extraction (ZeroRTE) requires identifying unseen relation types from text. The authors propose a Generative Meta-Learning framework built on bi-level optimization (BLO) that separates data fitting (LL) from generalization (UL) and introduces a task-aware generative model (TGM). They instantiate three BLO-driven meta-learning variants—TGM-Metric, TGM-Model, and TGM-Optimization—to capture cross-task knowledge directly from training data, without relying on synthesized data alone. Empirical results on FewRel and Wiki-ZSL demonstrate superior generalization over strong baselines and competitive performance against large language models, with detailed analyses of prompt settings and hyper-parameters. The work provides a BLO-based pathway to robust ZeroRTE, enabling more reliable extraction of unseen relations in practical IR and NLP tasks.
Abstract
Zero-shot Relation Triplet Extraction (ZeroRTE) aims to extract relation triplets from texts containing unseen relation types. This capability benefits various downstream information retrieval (IR) tasks. The primary challenge lies in enabling models to generalize effectively to unseen relation categories. Existing approaches typically leverage the knowledge embedded in pre-trained language models to accomplish the generalization process. However, these methods focus solely on fitting the training data during training, without specifically improving the model's generalization performance, resulting in limited generalization capability. For this reason, we explore the integration of bi-level optimization (BLO) with pre-trained language models for learning generalized knowledge directly from the training data, and propose a generative meta-learning framework which exploits the `learning-to-learn' ability of meta-learning to boost the generalization capability of generative models. Specifically, we introduce a BLO approach that simultaneously addresses data fitting and generalization. This is achieved by constructing an upper-level loss to focus on generalization and a lower-level loss to ensure accurate data fitting. Building on this, we subsequently develop three generative meta-learning methods, each tailored to a distinct category of meta-learning. Extensive experimental results demonstrate that our framework performs well on the ZeroRTE task. Our code is available at https://github.com/leeworry/TGM-MetaLearning.
