SWAG: Storytelling With Action Guidance
Zeeshan Patel, Karim El-Refai, Jonathan Pei, Tianle Li
TL;DR
SWAG addresses the challenge of engaging long-form storytelling by introducing a two-model feedback loop in which an action-discriminator LLM guides narrative direction and a story-generation model writes content. The AD LLM is trained via supervised fine-tuning and direct preference optimization on a GPT-4–generated action dataset, and the SWAG loop alternates between content generation and action selection to produce longer, more engaging narratives. Empirical results from both machine and human evaluations show SWAG substantially outperforms end-to-end generation baselines and can even surpass GPT-3.5-Turbo in several setups, while remaining compatible with open-source models through LongLoRA-based long-context fine-tuning. The proposed framework is modular and extensible, enabling fine-grained control over story progression and potential extensions such as test-time action generation or human-in-the-loop collaboration for diverse storytelling applications.
Abstract
Automated long-form story generation typically employs long-context large language models (LLMs) for one-shot creation, which can produce cohesive but not necessarily engaging content. We introduce Storytelling With Action Guidance (SWAG), a novel approach to storytelling with LLMs. Our approach frames story writing as a search problem through a two-model feedback loop: one LLM generates story content, and another auxiliary LLM is used to choose the next best "action" to steer the story's future direction. Our results show that SWAG can substantially outperform previous end-to-end story generation techniques when evaluated by GPT-4 and through human evaluation. Our SWAG pipeline using only small open-source models surpasses GPT-3.5-Turbo.
