Llamipa: An Incremental Discourse Parser
Kate Thompson, Akshay Chaturvedi, Julie Hunter, Nicholas Asher
TL;DR
This work addresses incremental SDRT-style discourse parsing by fine-tuning Llama-based LLMs to jointly predict attachment links and relation types using already inferred discourse structure. The Llamipa model leverages a large context window ($k=15$) and QLoRA-fine-tuning to produce incremental predictions, achieving state-of-the-art results on MSDC, STAC-L, and out-of-domain Molweni in relation labeling. Through comprehensive ablations, the authors demonstrate that explicit discourse structure in the input, as well as a broad contextual window, are crucial for robust long-distance relation predictions, especially for Narration and Correction. The approach holds promise for downstream conversational systems and tasks requiring structured discourse representations, while acknowledging segmentation requirements and cross-domain generalizability limitations.
Abstract
This paper provides the first discourse parsing experiments with a large language model(LLM) finetuned on corpora annotated in the style of SDRT (Segmented Discourse Representation Theory Asher, 1993; Asher and Lascarides, 2003). The result is a discourse parser, Llamipa (Llama Incremental Parser), that leverages discourse context, leading to substantial performance gains over approaches that use encoder-only models to provide local, context-sensitive representations of discourse units. Furthermore, it can process discourse data incrementally, which is essential for the eventual use of discourse information in downstream tasks.
