DACE For Railway Acronym Disambiguation
El Mokhtar Hribach, Oussama Mechhour, Mohammed Elmonstaser, Yassine El Boudouri, Othmane Kabal
TL;DR
The paper tackles Acronym Disambiguation in French railway documentation, a domain with high terminology ambiguity and limited annotated data. It introduces DACE, a modular framework that fuses Dynamic Prompting, Retrieval Augmented Generation, Contextual Selection, and Ensemble Aggregation to improve LLM-driven disambiguation while curbing hallucinations. On the TextMine'26 dataset, DACE achieves a top private F1 of 0.9069 and demonstrates stable performance across splits, underscoring its robustness to low-resource and domain-shift conditions. Ablation studies confirm that both adaptive prompting and knowledge grounding substantially contribute to performance, with ensemble reasoning further enhancing reliability. The framework’s general design suggests applicability beyond railways, including multilingual settings and graph-based knowledge integration future work.
Abstract
Acronym Disambiguation (AD) is a fundamental challenge in technical text processing, particularly in specialized sectors where high ambiguity complicates automated analysis. This paper addresses AD within the context of the TextMine'26 competition on French railway documentation. We present DACE (Dynamic Prompting, Retrieval Augmented Generation, Contextual Selection, and Ensemble Aggregation), a framework that enhances Large Language Models through adaptive in-context learning and external domain knowledge injection. By dynamically tailoring prompts to acronym ambiguity and aggregating ensemble predictions, DACE mitigates hallucination and effectively handles low-resource scenarios. Our approach secured the top rank in the competition with an F1 score of 0.9069.
