Table of Contents
Fetching ...

SLURP: A Spoken Language Understanding Resource Package

Emanuele Bastianelli, Andrea Vanzo, Pawel Swietojanski, Verena Rieser

TL;DR

SLURP delivers a large, multi-domain SLU dataset with 72k audio examples and three-tier semantic annotations (Scenario, Action, Entities), coupled with a novel SLU-F1 metric to better capture ASR-NLU error propagation. The authors show SLURP is more linguistically and semantically diverse than prior resources and that modular (pipeline) SLU baselines currently outperform end-to-end approaches on this data. They also demonstrate that a top-down, scenario-first decoding strategy is more robust to transcription noise than bottom-up entity-first methods. The work provides extensive linguistic analyses, introduces synthetic data augmentation, and emphasizes error analysis as a practical tool for improving SLU systems, while outlining future steps toward spontaneous speech data.

Abstract

Spoken Language Understanding infers semantic meaning directly from audio data, and thus promises to reduce error propagation and misunderstandings in end-user applications. However, publicly available SLU resources are limited. In this paper, we release SLURP, a new SLU package containing the following: (1) A new challenging dataset in English spanning 18 domains, which is substantially bigger and linguistically more diverse than existing datasets; (2) Competitive baselines based on state-of-the-art NLU and ASR systems; (3) A new transparent metric for entity labelling which enables a detailed error analysis for identifying potential areas of improvement. SLURP is available at https: //github.com/pswietojanski/slurp.

SLURP: A Spoken Language Understanding Resource Package

TL;DR

SLURP delivers a large, multi-domain SLU dataset with 72k audio examples and three-tier semantic annotations (Scenario, Action, Entities), coupled with a novel SLU-F1 metric to better capture ASR-NLU error propagation. The authors show SLURP is more linguistically and semantically diverse than prior resources and that modular (pipeline) SLU baselines currently outperform end-to-end approaches on this data. They also demonstrate that a top-down, scenario-first decoding strategy is more robust to transcription noise than bottom-up entity-first methods. The work provides extensive linguistic analyses, introduces synthetic data augmentation, and emphasizes error analysis as a practical tool for improving SLU systems, while outlining future steps toward spontaneous speech data.

Abstract

Spoken Language Understanding infers semantic meaning directly from audio data, and thus promises to reduce error propagation and misunderstandings in end-user applications. However, publicly available SLU resources are limited. In this paper, we release SLURP, a new SLU package containing the following: (1) A new challenging dataset in English spanning 18 domains, which is substantially bigger and linguistically more diverse than existing datasets; (2) Competitive baselines based on state-of-the-art NLU and ASR systems; (3) A new transparent metric for entity labelling which enables a detailed error analysis for identifying potential areas of improvement. SLURP is available at https: //github.com/pswietojanski/slurp.

Paper Structure

This paper contains 15 sections, 6 figures, 8 tables, 1 algorithm.

Figures (6)

  • Figure 1: Example annotation from SLURP dataset.
  • Figure 2: Amounts of data in SLURP matching given WER levels.
  • Figure 3: Syntactic complexity on D-Level scale, where higher levels correspond to more complex, deeper syntactic structures.
  • Figure 4: Continued example from Figure \ref{['fig:data']}: Errors in SLU entity tagging.
  • Figure 5: Error propagation: No Errors refer to the number of predicted entities that match the gold transcriptions perfectly. ASR Errors count the number of predictions where ASR outputs an unmatched candidate but the NLU system is nevertheless able to recover the correct entities from the transcriptions. NLU Errors count sentences where transcriptions are correct, but entities do not match. ASR/NLU Errors count the sentences where both ASR and NLU errors are present.
  • ...and 1 more figures