Table of Contents
Fetching ...

DragonVerseQA: Open-Domain Long-Form Context-Aware Question-Answering

Aritra Kumar Lahiri, Qinmin Vivian Hu

TL;DR

DragonVerseQA addresses the need for context-rich open-domain long-form QA in serialized TV narratives by building a multi-source dataset that combines episode summaries, audience reviews, and structured data. It deploys a context-aware QA pipeline that uses zero-shot GPT-3 summarization for contextual inputs, BLANC for answer-span prediction, and AAQG for generating long-form questions, with spam and bias filtering to ensure quality. The dataset contains 3200 QA pairs across multiple episodes and is linked to a supportive knowledge graph, enabling narrative reasoning and sentiment analysis; it is evaluated against open benchmarks and through ablation studies to demonstrate the value of multi-source context and hierarchical decoding. The work advances narrative understanding and opens pathways for richer AI-driven interactions in entertainment domains, including improved conversational agents and more nuanced content analysis.

Abstract

This paper proposes a novel approach to develop an open-domain and long-form Over-The-Top (OTT) Question-Answering (QA) dataset, DragonVerseQA, specifically oriented to the fantasy universe of "House of the Dragon" and "Game Of Thrones" TV series. Most existing QA datasets focus on short, fact-based answers sourced almost solely from Wikipedia articles, devoid of depth and contextual richness for sophisticated narrative understanding. We curate a dataset that combines full episode summaries sourced from HBO and fandom wiki websites, user reviews from sources like IMDb and Rotten Tomatoes, and high-quality, open-domain, legally admissible sources, and structured data from repositories like WikiData into one dataset. The dataset provides a multi-dimensional context, reflecting complex character dynamics and plot developments from these varied sources. That means, on equal footing, only after heavy data preprocessing and filtering methods will meaningful, non-spam unbiased reviews be available in this enriched dataset. The comprehensive insights are given through the long-form answers generated from this enriched context. This is what makes this valuable dataset for improving conversational AI, narrative analysis, sentiment analysis, summarization techniques, and relation extraction. A comparative analysis with state-of-the-art QA datasets such as SQuAD 2.0, TriviaQA, and Natural Questions brings to light the unique advantages of our dataset in terms of contextual complexity and answer length. Detailed reviews add layers to audience sentiment and narrative interpretation, raising the bar for domain-specific QA with a new quality benchmark. Our work also allows a deeper understanding of entertainment-industry content and opens the door to more knowledgeable and creative AI-driven interactions within digital media environments.

DragonVerseQA: Open-Domain Long-Form Context-Aware Question-Answering

TL;DR

DragonVerseQA addresses the need for context-rich open-domain long-form QA in serialized TV narratives by building a multi-source dataset that combines episode summaries, audience reviews, and structured data. It deploys a context-aware QA pipeline that uses zero-shot GPT-3 summarization for contextual inputs, BLANC for answer-span prediction, and AAQG for generating long-form questions, with spam and bias filtering to ensure quality. The dataset contains 3200 QA pairs across multiple episodes and is linked to a supportive knowledge graph, enabling narrative reasoning and sentiment analysis; it is evaluated against open benchmarks and through ablation studies to demonstrate the value of multi-source context and hierarchical decoding. The work advances narrative understanding and opens pathways for richer AI-driven interactions in entertainment domains, including improved conversational agents and more nuanced content analysis.

Abstract

This paper proposes a novel approach to develop an open-domain and long-form Over-The-Top (OTT) Question-Answering (QA) dataset, DragonVerseQA, specifically oriented to the fantasy universe of "House of the Dragon" and "Game Of Thrones" TV series. Most existing QA datasets focus on short, fact-based answers sourced almost solely from Wikipedia articles, devoid of depth and contextual richness for sophisticated narrative understanding. We curate a dataset that combines full episode summaries sourced from HBO and fandom wiki websites, user reviews from sources like IMDb and Rotten Tomatoes, and high-quality, open-domain, legally admissible sources, and structured data from repositories like WikiData into one dataset. The dataset provides a multi-dimensional context, reflecting complex character dynamics and plot developments from these varied sources. That means, on equal footing, only after heavy data preprocessing and filtering methods will meaningful, non-spam unbiased reviews be available in this enriched dataset. The comprehensive insights are given through the long-form answers generated from this enriched context. This is what makes this valuable dataset for improving conversational AI, narrative analysis, sentiment analysis, summarization techniques, and relation extraction. A comparative analysis with state-of-the-art QA datasets such as SQuAD 2.0, TriviaQA, and Natural Questions brings to light the unique advantages of our dataset in terms of contextual complexity and answer length. Detailed reviews add layers to audience sentiment and narrative interpretation, raising the bar for domain-specific QA with a new quality benchmark. Our work also allows a deeper understanding of entertainment-industry content and opens the door to more knowledgeable and creative AI-driven interactions within digital media environments.

Paper Structure

This paper contains 22 sections, 8 equations, 8 figures, 12 tables, 4 algorithms.

Figures (8)

  • Figure 1: DragonVerseQA - Dataset structure
  • Figure 2: Filtered User Reviews
  • Figure 3: Contextual Data Integration
  • Figure 4: Context-Aware Question Answering: Architecture
  • Figure 5: Ambiguity Resolution during Question Generation
  • ...and 3 more figures