A Dynamic Fusion Model for Consistent Crisis Response
Xiaoying Song, Anirban Saha Anik, Eduardo Blanco, Vanessa Frias-Martinez, Lingzi Hong
TL;DR
This paper addresses the problem of stylistic inconsistency in crisis-response generation by defining a formal consistency metric across professionalism, actionability, and relevance, and introducing a fusion-based generation framework that combines Instructional Prompt and RAG with evaluation-guided synthesis. The fusion mechanism is formalized as $CC(N, D) = \mathcal{L}(\text{Fuse}( M_{\text{IP}}(N), M_{\text{RAG}}(N), \mathbf{s}_{\text{IP}}, \mathbf{s}_{\text{RAG}} ))$, enabling instance-level optimization across critical communicative dimensions. Empirical evaluation on a large Twitter crisis dataset shows fusion-based methods improve overall quality and drastically reduce variation compared with baselines, with cross-crisis generalization demonstrating robustness to earthquakes and typhoons. Human evaluations further corroborate the superiority of fused responses in terms of consistency and usefulness, highlighting practical impact for scalable, trustworthy crisis communication. The work advances crisis informatics by providing a scalable, model-agnostic approach to producing uniformly high-quality guidance during disasters.
Abstract
In response to the urgent need for effective communication with crisis-affected populations, automated responses driven by language models have been proposed to assist in crisis communications. A critical yet often overlooked factor is the consistency of response style, which could affect the trust of affected individuals in responders. Despite its importance, few studies have explored methods for maintaining stylistic consistency across generated responses. To address this gap, we propose a novel metric for evaluating style consistency and introduce a fusion-based generation approach grounded in this metric. Our method employs a two-stage process: it first assesses the style of candidate responses and then optimizes and integrates them at the instance level through a fusion process. This enables the generation of high-quality responses while significantly reducing stylistic variation between instances. Experimental results across multiple datasets demonstrate that our approach consistently outperforms baselines in both response quality and stylistic uniformity.
