Ex3: Automatic Novel Writing by Extracting, Excelsior and Expanding
Lei Huang, Jiaming Guo, Guanhua He, Xishan Zhang, Rui Zhang, Shaohui Peng, Shaoli Liu, Tianshi Chen
TL;DR
Ex3 tackles long-form novel generation by learning from raw novels rather than relying solely on prompt engineering. It introduces a three-stage framework—Extracting, Excelsior, and Expanding—that first derives hierarchical structure and entity information from texts, then fine-tunes an instruction-following LLM on a structure-informed corpus, and finally expands premises into arbitrarily long narratives via a depth-first, tree-like generation process. The approach yields higher-quality long-form novels than prior hierarchical methods, demonstrated through comprehensive human evaluations and automation metrics, and shows robust performance across medium- and long-length stories. The framework embodies a self-improvement loop by using summarization to train the model, reducing reliance on hand-crafted prompts and enabling controllable, genre-aligned writing with potential for multi-language and interactive generation in future work.
Abstract
Generating long-term texts such as novels using artificial intelligence has always been a challenge. A common approach is to use large language models (LLMs) to construct a hierarchical framework that first plans and then writes. Despite the fact that the generated novels reach a sufficient length, they exhibit poor logical coherence and appeal in their plots and deficiencies in character and event depiction, ultimately compromising the overall narrative quality. In this paper, we propose a method named Extracting Excelsior and Expanding. Ex3 initially extracts structure information from raw novel data. By combining this structure information with the novel data, an instruction-following dataset is meticulously crafted. This dataset is then utilized to fine-tune the LLM, aiming for excelsior generation performance. In the final stage, a tree-like expansion method is deployed to facilitate the generation of arbitrarily long novels. Evaluation against previous methods showcases Ex3's ability to produce higher-quality long-form novels.
