Three Stage Narrative Analysis; Plot-Sentiment Breakdown, Structure Learning and Concept Detection
Taimur Khan, Ramoza Ahsan, Mohib Hameed
TL;DR
The paper tackles automated narrative understanding by analyzing sentiment arcs, structure, and concepts in movie scripts. It presents a three-stage framework comprising Plot-Sentiment Breakdown with a custom NRC-VAD–LabMT lexicon, Structure Learning for segment classification, and Concept Detection for high/low concepts. Experiments on 1,000 scripts using Ward's hierarchical clustering reveal universal emotional arcs and demonstrate cross-genre similarities, suggesting practical value for publishers, writers, and researchers. Limitations include lexicon-based sentiment context and partial concept detection; future work envisions contextual embeddings, hybrid clustering with SOMs, and multilingual expansion to enhance robustness and applicability.
Abstract
Story understanding and analysis have long been challenging areas within Natural Language Understanding. Automated narrative analysis requires deep computational semantic representations along with syntactic processing. Moreover, the large volume of narrative data demands automated semantic analysis and computational learning rather than manual analytical approaches. In this paper, we propose a framework that analyzes the sentiment arcs of movie scripts and performs extended analysis related to the context of the characters involved. The framework enables the extraction of high-level and low-level concepts conveyed through the narrative. Using dictionary-based sentiment analysis, our approach applies a custom lexicon built with the LabMTsimple storylab module. The custom lexicon is based on the Valence, Arousal, and Dominance scores from the NRC-VAD dataset. Furthermore, the framework advances the analysis by clustering similar sentiment plots using Wards hierarchical clustering technique. Experimental evaluation on a movie dataset shows that the resulting analysis is helpful to consumers and readers when selecting a narrative or story.
