Shaping Human-AI Collaboration: Varied Scaffolding Levels in Co-writing with Language Models
Paramveer S. Dhillon, Somayeh Molaei, Jiaqi Li, Maximilian Golub, Shaochun Zheng, Lionel P. Robert
TL;DR
This study investigates how varying AI scaffolding levels in co-writing with a large language model affect writing quality, productivity, and user experience. Using a within-subject, Latin-square field experiment (N=$131$) with three conditions (no AI, next-sentence, next-paragraph) and a custom GPT-3-based tool, the authors reveal a U-shaped effect: high-level paragraph scaffolding substantially improves quality and speed, especially for non-regular and less tech-savvy writers, while low-level sentence scaffolding can reduce quality and ownership. The results also show that user satisfaction and sense of authorship decline under scaffolded conditions, underscoring the need for personalized, adaptive scaffolding that preserves human agency. The findings offer concrete guidance for designing AI writing assistants that enhance output without eroding engagement, suggesting dynamic, user-aware scaffolding as a key direction for productive human-AI collaboration in writing.
Abstract
Advances in language modeling have paved the way for novel human-AI co-writing experiences. This paper explores how varying levels of scaffolding from large language models (LLMs) shape the co-writing process. Employing a within-subjects field experiment with a Latin square design, we asked participants (N=131) to respond to argumentative writing prompts under three randomly sequenced conditions: no AI assistance (control), next-sentence suggestions (low scaffolding), and next-paragraph suggestions (high scaffolding). Our findings reveal a U-shaped impact of scaffolding on writing quality and productivity (words/time). While low scaffolding did not significantly improve writing quality or productivity, high scaffolding led to significant improvements, especially benefiting non-regular writers and less tech-savvy users. No significant cognitive burden was observed while using the scaffolded writing tools, but a moderate decrease in text ownership and satisfaction was noted. Our results have broad implications for the design of AI-powered writing tools, including the need for personalized scaffolding mechanisms.
