Designing and Evaluating Dialogue LLMs for Co-Creative Improvised Theatre
Boyd Branch, Piotr Mirowski, Kory Mathewson, Sophia Ppali, Alexandra Covaci
TL;DR
This study investigates co-creative improvised theatre with multi-party dialogue between humans and LLM-powered agents in live performances. It deploys a participatory human-in-the-loop framework, featuring an Operator and a Curator to manage context and select AI lines across 26 Edinburgh Fringe shows, using three LLMs (GPT-3.5/4, PaLM 2, Llama 2). Through audience and actor surveys, logs, and post-hoc analysis, the work reveals nuanced perceptions of AI as a creativity partner rather than a fully autonomous performer, while highlighting technical and ethical challenges such as latency, turn-taking, and copyright concerns. The findings inform design principles for future live AI performances, emphasizing enhanced MPC capabilities, improved data provisioning, and human-centered interfaces to sustain engaging co-creative experiences in the arts.
Abstract
Social robotics researchers are increasingly interested in multi-party trained conversational agents. With a growing demand for real-world evaluations, our study presents Large Language Models (LLMs) deployed in a month-long live show at the Edinburgh Festival Fringe. This case study investigates human improvisers co-creating with conversational agents in a professional theatre setting. We explore the technical capabilities and constraints of on-the-spot multi-party dialogue, providing comprehensive insights from both audience and performer experiences with AI on stage. Our human-in-the-loop methodology underlines the challenges of these LLMs in generating context-relevant responses, stressing the user interface's crucial role. Audience feedback indicates an evolving interest for AI-driven live entertainment, direct human-AI interaction, and a diverse range of expectations about AI's conversational competence and utility as a creativity support tool. Human performers express immense enthusiasm, varied satisfaction, and the evolving public opinion highlights mixed emotions about AI's role in arts.
