Discourse-Aware In-Context Learning for Temporal Expression Normalization
Akash Kumar Gautam, Lukas Lange, Jannik Strötgen
TL;DR
The paper tackles temporal expression normalization under data scarcity and domain shift by adopting discourse-aware in-context learning with retrieved examples and a document-level context window. It designs a prompt framework and sample-selection strategies for LLMs to perform TE normalization without fine-tuning, evaluated across six domains and seven languages with GPT-3.5-turbo and Zephyr. Results show competitive performance to dedicated normalization models, with notable gains when target documents are distant from training data, especially using Target-centric + Context Window prompts. The work demonstrates the practical potential of zero-/few-shot TE normalization in multilingual, cross-domain IE pipelines, while identifying limitations in long-range discourse and language coverage.
Abstract
Temporal expression (TE) normalization is a well-studied problem. However, the predominately used rule-based systems are highly restricted to specific settings, and upcoming machine learning approaches suffer from a lack of labeled data. In this work, we explore the feasibility of proprietary and open-source large language models (LLMs) for TE normalization using in-context learning to inject task, document, and example information into the model. We explore various sample selection strategies to retrieve the most relevant set of examples. By using a window-based prompt design approach, we can perform TE normalization across sentences, while leveraging the LLM knowledge without training the model. Our experiments show competitive results to models designed for this task. In particular, our method achieves large performance improvements for non-standard settings by dynamically including relevant examples during inference.
