CroPrompt: Cross-task Interactive Prompting for Zero-shot Spoken Language Understanding
Libo Qin, Fuxuan Wei, Qiguang Chen, Jingxuan Zhou, Shijue Huang, Jiasheng Si, Wenpeng Lu, Wanxiang Che
TL;DR
CroPrompt tackles zero-shot spoken language understanding by explicitly enabling cross-task interaction between intent detection and slot filling through a two-stage prompting pipeline. It further mitigates error propagation with a multi-task self-consistency mechanism that ensembles reasoning across routes for both sentence-level intents and token-level slot spans. Empirical results on SNIPS across multiple LLMs show CroPrompt consistently surpasses vanilla prompting and benefits substantially from self-consistency, with GPT-4 achieving strong sentence accuracy and slot F1. The approach also demonstrates token-cost reductions and better generalization to downstream dialogue tasks under SFT, suggesting cross-task prompting as a viable path for data-efficient SLU. Overall, CroPrompt establishes a principled framework for cross-task information exchange in prompt-based SLU with robust performance improvements.
Abstract
Slot filling and intent detection are two highly correlated tasks in spoken language understanding (SLU). Recent SLU research attempts to explore zero-shot prompting techniques in large language models to alleviate the data scarcity problem. Nevertheless, the existing prompting work ignores the cross-task interaction information for SLU, which leads to sub-optimal performance. To solve this problem, we present the pioneering work of Cross-task Interactive Prompting (CroPrompt) for SLU, which enables the model to interactively leverage the information exchange across the correlated tasks in SLU. Additionally, we further introduce a multi-task self-consistency mechanism to mitigate the error propagation caused by the intent information injection. We conduct extensive experiments on the standard SLU benchmark and the results reveal that CroPrompt consistently outperforms the existing prompting approaches. In addition, the multi-task self-consistency mechanism can effectively ease the error propagation issue, thereby enhancing the performance. We hope this work can inspire more research on cross-task prompting for SLU.
