Botender: Supporting Communities in Collaboratively Designing AI Agents through Case-Based Provocations
Tzu-Sheng Kuo, Sophia Liu, Quan Ze Chen, Joseph Seering, Amy X. Zhang, Haiyi Zhu, Kenneth Holstein
TL;DR
Botender enables communities to co-create LLM-powered bots through a collaborative workflow that centers case-based provocations to surface disagreements and opportunities for improvement. The system integrates with Discord, supports proposing, iterating on prompts, and deploying updates, and relies on three LLM pipelines to generate provocative, context-rich test cases. A validation study shows that Botender’s provocations uncover more potential improvements and disagreements than baseline test cases, while a five-day field study demonstrates practical, culture-aligned bot design across six real communities. Together, these findings highlight how participatory design, integrated tooling, and provocative case generation can expand community governance of AI-backed infrastructures while pointing to future work on scalability, multi-turn agent design, and power dynamics.
Abstract
AI agents, or bots, serve important roles in online communities. However, they are often designed by outsiders or a few tech-savvy members, leading to bots that may not align with the broader community's needs. How might communities collectively shape the behavior of community bots? We present Botender, a system that enables communities to collaboratively design LLM-powered bots without coding. With Botender, community members can directly propose, iterate on, and deploy custom bot behaviors tailored to community needs. Botender facilitates testing and iteration on bot behavior through case-based provocations: interaction scenarios generated to spark user reflection and discussion around desirable bot behavior. A validation study found these provocations more useful than standard test cases for revealing improvement opportunities and surfacing disagreements. During a five-day deployment across six Discord servers, Botender supported communities in tailoring bot behavior to their specific needs, showcasing the usefulness of case-based provocations in facilitating collaborative bot design.
