Table of Contents
Fetching ...

EventWeave: A Dynamic Framework for Capturing Core and Supporting Events in Dialogue Systems

Zhengyi Zhao, Shubo Zhang, Yiming Du, Bin Liang, Baojun Wang, Zhongyang Li, Binyang Li, Kam-Fai Wong

TL;DR

The paper tackles the challenge that dialogue models often treat turns in isolation, neglecting underlying event structures that guide natural interactions. It introduces EventWeave, a dynamic hierarchical event-graph framework that differentiates core goals from supporting details and uses a multi-head attention mechanism to selectively retrieve relevant events for each turn. The approach defines three edge types—sequential, temporal, and reasoning—to model nuanced event relationships, along with adaptive node preservation and a multi-perspective graph-retrieval strategy to guide response generation. Experiments on Conversation Chronicle, Multi-Session Chat, and LoCoMo show that EventWeave yields more natural and contextually appropriate responses while reducing computation through graph-based representations and pruning. Together, these results suggest a scalable route to more coherent, efficient long-form dialogue systems that leverage structured event reasoning.

Abstract

Large language models have improved dialogue systems, but often process conversational turns in isolation, overlooking the event structures that guide natural interactions. Hence we introduce \textbf{EventWeave}, a framework that explicitly models relationships between conversational events to generate more contextually appropriate dialogue responses. EventWeave constructs a dynamic event graph that distinguishes between core events (main goals) and supporting events (interconnected details), employing a multi-head attention mechanism to selectively determine which events are most relevant to the current turn. Unlike summarization or standard graph-based approaches, our method captures three distinct relationship types between events, allowing for more nuanced context modeling. Experiments on three dialogue datasets demonstrate that EventWeave produces more natural and contextually appropriate responses while requiring less computational overhead than models processing the entire dialogue history. Ablation studies confirm improvements stem from better event relationship modeling rather than increased information density. Our approach effectively balances comprehensive context understanding with generating concise responses, maintaining strong performance across various dialogue lengths through targeted optimization techniques.

EventWeave: A Dynamic Framework for Capturing Core and Supporting Events in Dialogue Systems

TL;DR

The paper tackles the challenge that dialogue models often treat turns in isolation, neglecting underlying event structures that guide natural interactions. It introduces EventWeave, a dynamic hierarchical event-graph framework that differentiates core goals from supporting details and uses a multi-head attention mechanism to selectively retrieve relevant events for each turn. The approach defines three edge types—sequential, temporal, and reasoning—to model nuanced event relationships, along with adaptive node preservation and a multi-perspective graph-retrieval strategy to guide response generation. Experiments on Conversation Chronicle, Multi-Session Chat, and LoCoMo show that EventWeave yields more natural and contextually appropriate responses while reducing computation through graph-based representations and pruning. Together, these results suggest a scalable route to more coherent, efficient long-form dialogue systems that leverage structured event reasoning.

Abstract

Large language models have improved dialogue systems, but often process conversational turns in isolation, overlooking the event structures that guide natural interactions. Hence we introduce \textbf{EventWeave}, a framework that explicitly models relationships between conversational events to generate more contextually appropriate dialogue responses. EventWeave constructs a dynamic event graph that distinguishes between core events (main goals) and supporting events (interconnected details), employing a multi-head attention mechanism to selectively determine which events are most relevant to the current turn. Unlike summarization or standard graph-based approaches, our method captures three distinct relationship types between events, allowing for more nuanced context modeling. Experiments on three dialogue datasets demonstrate that EventWeave produces more natural and contextually appropriate responses while requiring less computational overhead than models processing the entire dialogue history. Ablation studies confirm improvements stem from better event relationship modeling rather than increased information density. Our approach effectively balances comprehensive context understanding with generating concise responses, maintaining strong performance across various dialogue lengths through targeted optimization techniques.

Paper Structure

This paper contains 59 sections, 21 equations, 10 figures, 15 tables.

Figures (10)

  • Figure 1: An illustrative conversation between two friends planning to attend a music festival. Supporting events play a key role in enriching the dialogue's context and shaping the interpersonal dynamic.
  • Figure 2: EventWeave architecture illustrating the three main components: (1) Event Extraction Module that identifies core and supporting events from dialogue turns, (2) Dynamic Graph Construction that establishes relationship types between events, and (3) Context-Aware Response Generation that selectively retrieves relevant event nodes using a multi-head attention mechanism.
  • Figure 3: Memory efficiency comparison on LoCoMo. EventWeave achieves higher ROUGE-L scores with lower token consumption compared to other memory mechanisms.
  • Figure 4: Case study showing EventWeave's reasoning process across a multi-turn dialogue. The model identifies both core events (the missing and subsequent return of the cat) and supporting events (the cat's preference for tuna), then integrates them to generate a contextually appropriate response.
  • Figure 5: Event node retrieval performance with ICL on both Conversation Chronicle and MSC for LLaMA3.
  • ...and 5 more figures