Adapting Knowledge for Few-shot Table-to-Text Generation
Zhixin Guo, Minyxuan Yan, Jiexing Qi, Jianping Zhou, Ziwei He, Guanjie Zheng, Xinbing Wang
TL;DR
The paper tackles the challenge of few-shot table-to-text generation by introducing AKG, a modular framework that leverages unlabeled domain-specific knowledge through prompt templates and a knowledge-adaptation mechanism. It decomposes the task into Generation, Knowledge-Augmentation, and Fine-Tuning modules, incorporating a Prototype-Selection Task, a knowledge-augmented language modeling objective, and a Knowledge Adapter to fuse domain knowledge with tabular cues. Using three multidomain datasets from WIKIBIO (Humans, Books, Songs), AKG demonstrates superior fluency and faithfulness versus strong baselines, aided by an information-retrieval prototype memory and unlabeled corpus prompts. The approach shows robust gains in both automatic metrics (BLEU-4, ROUGE-4, PARENT-F) and human judgments, highlighting the practical impact of modular, knowledge-aware pretraining for real-world table-to-text generation under data scarcity.
Abstract
Pretrained language models (PLMs) have made remarkable progress in table-to-text generation tasks. However, the lack of domain-specific knowledge makes it challenging to bridge the topological gap between tabular data and text, especially in real-world applications with limited resources. To mitigate the limitation of insufficient labeled data, we propose a novel framework: Adapt-Knowledge-to-Generate (AKG). The core insight of AKG is to adapt unlabeled domain-specific knowledge into the model, which brings at least three benefits: (1) it injects representation of normal table-related descriptions to bridge the topological gap between tabular data and texts; (2) it enables us to use large amounts of unlabeled domain-specific knowledge fully, which can alleviate the PLMs' inherent shortcomings of lacking domain knowledge; (3) it allows us to design various tasks to employ the domain-specific knowledge. Extensive experiments and analyses are conducted on three open-domain, few-shot natural language generation (NLG) data sets: Humans, Songs, and Books. Compared to previous state-of-the-art approaches, our model achieves superior performance in terms of both fluency and accuracy as judged by human and automatic evaluations.
