Why Human Guidance Matters in Collaborative Vibe Coding
Haoyu Hu, Raja Marjieh, Katherine M Collins, Chenyi Li, Thomas L. Griffiths, Ilia Sucholutsky, Nori Jacoby
TL;DR
The paper investigates how human guidance shapes collaborative vibe coding, a process where high-level explanations steer AI to iteratively generate and refine SVG outputs. It introduces a controlled experimental framework to compare human-led, AI-led, and hybrid configurations across 16 experiments with 604 participants, finding that humans provide durable, high-level guidance while AI-led instructions often collapse over iterations. Hybrid setups with humans directing and AI evaluating and executing yield the best performance, while pure AI guidance degrades. These results reveal misalignment in language use between humans and AI and offer practical design principles for scalable, human-centered AI collaboration beyond programming.
Abstract
Writing code has been one of the most transformative ways for human societies to translate abstract ideas into tangible technologies. Modern AI is transforming this process by enabling experts and non-experts alike to generate code without actually writing code, but instead, through natural language instructions, or "vibe coding". While increasingly popular, the cumulative impact of vibe coding on productivity and collaboration, as well as the role of humans in this process, remains unclear. Here, we introduce a controlled experimental framework for studying collaborative vibe coding and use it to compare human-led, AI-led, and hybrid groups. Across 16 experiments involving 604 human participants, we show that people provide uniquely effective high-level instructions for vibe coding across iterations, whereas AI-provided instructions often result in performance collapse. We further demonstrate that hybrid systems perform best when humans retain directional control (providing the instructions), while evaluation is delegated to AI.
