Table of Contents
Fetching ...

Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach

Shuqi Liu, Han Wu, Guanzhi Deng, Jianshu Chen, Xiaoyang Wang, Linqi Song

TL;DR

<3-5 sentence high-level summary>The paper tackles the challenge of interpretability in knowledge-enhanced language generation by introducing a task-agnostic Structured Knowledge Hunter that exploits a two-tier knowledge hierarchy (high-level entities and low-level knowledge triples). It combines a local-global entity-centric encoder with a hierarchical pointer planner to extract coherent knowledge plans that guide language model generation through a template-based conversion, enabling human-readable justification of outputs. Across internal table-to-text (RotoWire-FG) and external knowledge-grounded dialogue (KdConv), the method improves knowledge selection accuracy and text quality, outperforming task-specific baselines and standard LMs while maintaining interpretability. Ablation studies and human evaluation support the benefits of entity-aware encoding, single local-global fusion layer usage, and the multi-task learning losses in preserving hierarchical consistency and factual grounding.

Abstract

Knowledge-enhanced text generation aims to enhance the quality of generated text by utilizing internal or external knowledge sources. While language models have demonstrated impressive capabilities in generating coherent and fluent text, the lack of interpretability presents a substantial obstacle. The limited interpretability of generated text significantly impacts its practical usability, particularly in knowledge-enhanced text generation tasks that necessitate reliability and explainability. Existing methods often employ domain-specific knowledge retrievers that are tailored to specific data characteristics, limiting their generalizability to diverse data types and tasks. To overcome this limitation, we directly leverage the two-tier architecture of structured knowledge, consisting of high-level entities and low-level knowledge triples, to design our task-agnostic structured knowledge hunter. Specifically, we employ a local-global interaction scheme for structured knowledge representation learning and a hierarchical transformer-based pointer network as the backbone for selecting relevant knowledge triples and entities. By combining the strong generative ability of language models with the high faithfulness of the knowledge hunter, our model achieves high interpretability, enabling users to comprehend the model output generation process. Furthermore, we empirically demonstrate the effectiveness of our model in both internal knowledge-enhanced table-to-text generation on the RotoWireFG dataset and external knowledge-enhanced dialogue response generation on the KdConv dataset. Our task-agnostic model outperforms state-of-the-art methods and corresponding language models, setting new standards on the benchmark.

Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach

TL;DR

<3-5 sentence high-level summary>The paper tackles the challenge of interpretability in knowledge-enhanced language generation by introducing a task-agnostic Structured Knowledge Hunter that exploits a two-tier knowledge hierarchy (high-level entities and low-level knowledge triples). It combines a local-global entity-centric encoder with a hierarchical pointer planner to extract coherent knowledge plans that guide language model generation through a template-based conversion, enabling human-readable justification of outputs. Across internal table-to-text (RotoWire-FG) and external knowledge-grounded dialogue (KdConv), the method improves knowledge selection accuracy and text quality, outperforming task-specific baselines and standard LMs while maintaining interpretability. Ablation studies and human evaluation support the benefits of entity-aware encoding, single local-global fusion layer usage, and the multi-task learning losses in preserving hierarchical consistency and factual grounding.

Abstract

Knowledge-enhanced text generation aims to enhance the quality of generated text by utilizing internal or external knowledge sources. While language models have demonstrated impressive capabilities in generating coherent and fluent text, the lack of interpretability presents a substantial obstacle. The limited interpretability of generated text significantly impacts its practical usability, particularly in knowledge-enhanced text generation tasks that necessitate reliability and explainability. Existing methods often employ domain-specific knowledge retrievers that are tailored to specific data characteristics, limiting their generalizability to diverse data types and tasks. To overcome this limitation, we directly leverage the two-tier architecture of structured knowledge, consisting of high-level entities and low-level knowledge triples, to design our task-agnostic structured knowledge hunter. Specifically, we employ a local-global interaction scheme for structured knowledge representation learning and a hierarchical transformer-based pointer network as the backbone for selecting relevant knowledge triples and entities. By combining the strong generative ability of language models with the high faithfulness of the knowledge hunter, our model achieves high interpretability, enabling users to comprehend the model output generation process. Furthermore, we empirically demonstrate the effectiveness of our model in both internal knowledge-enhanced table-to-text generation on the RotoWireFG dataset and external knowledge-enhanced dialogue response generation on the KdConv dataset. Our task-agnostic model outperforms state-of-the-art methods and corresponding language models, setting new standards on the benchmark.

Paper Structure

This paper contains 33 sections, 15 equations, 3 figures, 8 tables.

Figures (3)

  • Figure 1: Generation examples of Structured Knowledge Hunter for numerical table summarization (top) and knowledge graph augmented dialogue (down).
  • Figure 2: Overall pipeline paradigm of steering language model-based text generator with a Structured Knowledge Hunter. The Knowledge Hunter is designed as a hierarchical encoder-decoder system that searches for relevant knowledge triples from input knowledge graphs or numerical tables. These knowledge triples are then processed through a Template Conversion stage that transforms them into a format that can be readily fed into the language model. Finally, the language model generates text based on the processed knowledge triples.
  • Figure 3: A hypothetical example from the KdConv dataset and responses generated by our model HuntAug-BART for a travel conversation.