Scenarios and Approaches for Situated Natural Language Explanations

Pengshuo Qiu; Frank Rudzicz; Zining Zhu

Scenarios and Approaches for Situated Natural Language Explanations

Pengshuo Qiu, Frank Rudzicz, Zining Zhu

TL;DR

The paper addresses how to tailor natural language explanations to specific audiences and introduces the Situation-Based Explanation (SBE) benchmark consisting of 100 explananda with audience-targeted explanations. It systematically evaluates rule-based prompting, meta-prompting, and in-context learning across multiple LLMs, using similarity and cross-entropy-based matching metrics to quantify adaptation. Key findings show that specifying both audience and desired features yields the strongest alignment with human explanations, that persona prompts offer limited benefits, and that in-context learning can boost similarity but may not fully capture situational context; GPT-4 and Gemini-Pro can underperform compared with GPT-3.5 due to overly long outputs. The work provides a quantitative foundation and practical guidelines for developing situated NLE tools and paves the way for broader datasets and evaluation in audience-aware explanations.

Abstract

Large language models (LLMs) can be used to generate natural language explanations (NLE) that are adapted to different users' situations. However, there is yet to be a quantitative evaluation of the extent of such adaptation. To bridge this gap, we collect a benchmarking dataset, Situation-Based Explanation. This dataset contains 100 explanandums. Each explanandum is paired with explanations targeted at three distinct audience types-such as educators, students, and professionals-enabling us to assess how well the explanations meet the specific informational needs and contexts of these diverse groups e.g. students, teachers, and parents. For each "explanandum paired with an audience" situation, we include a human-written explanation. These allow us to compute scores that quantify how the LLMs adapt the explanations to the situations. On an array of pretrained language models with varying sizes, we examine three categories of prompting methods: rule-based prompting, meta-prompting, and in-context learning prompting. We find that 1) language models can generate prompts that result in explanations more precisely aligned with the target situations, 2) explicitly modeling an "assistant" persona by prompting "You are a helpful assistant..." is not a necessary prompt technique for situated NLE tasks, and 3) the in-context learning prompts only can help LLMs learn the demonstration template but can't improve their inference performance. SBE and our analysis facilitate future research towards generating situated natural language explanations.

Scenarios and Approaches for Situated Natural Language Explanations

TL;DR

Abstract

Paper Structure (36 sections, 2 equations, 6 figures, 4 tables)

This paper contains 36 sections, 2 equations, 6 figures, 4 tables.

Introduction
Related Work
Natural language explanation
Human-centered explanation
Cultural and societal knowledge
Data
Methods
Rule-based prompting methods
Base prompt
Specify the audience or the desired feature
Adopt a persona
Elicit the NLE with complete sentences
Meta prompt
In-context Learning Prompt
Experiment setup
...and 21 more sections

Figures (6)

Figure 1: Different audiences need different explanations.
Figure 2: The distribution of categories in SBE.
Figure 3: Average similarity and matching scores for all prompt techniques. 'M-GPT' refers to the use of GPT-3.5-turbo to generate prompts for situated NLE. 'Meta' refers to using the response model itself to generate prompts and respond to those. Note: A decrease in the matching score correlates with an enhancement in model performance on situated NLE tasks.
Figure 4: Average similarity and matching scores for all LLMs: 'P-2.8' represents Pythia-2.8B, 'L-7' stands for LLaMa-7B, 'L-13' is LLaMa-13B, and 'Y-34' indicates Yi-34B. Note: A decrease in the matching score correlates with an enhancement in model performance on situated NLE tasks.
Figure 5: Similarity score heatmap.
...and 1 more figures

Scenarios and Approaches for Situated Natural Language Explanations

TL;DR

Abstract

Scenarios and Approaches for Situated Natural Language Explanations

Authors

TL;DR

Abstract

Table of Contents

Figures (6)