Evolving AI Collectives to Enhance Human Diversity and Enable Self-Regulation
Shiyang Lai, Yujin Potter, Junsol Kim, Richard Zhuang, Dawn Song, James Evans
TL;DR
The paper investigates free-formed AI collectives—decentralized groups of interacting LLM agents without preassigned roles—to understand emergent diversity, norms, and self-regulation. Through cocktail-party simulations and downstream tasks, it shows that such collectives can develop local, diverse perspectives and pro-social norms, improving creativity and robustness to poisoning compared with solitary agents. The findings suggest potential for large-scale AI-driven innovation and self-moderation, while also detailing open challenges in scalability, heterogeneity, alignment, and safety. The work advocates interdisciplinary collaboration to harness benefits and mitigate risks as AI collectives become more prevalent online.
Abstract
Large language model behavior is shaped by the language of those with whom they interact. This capacity and their increasing prevalence online portend that they will intentionally or unintentionally "program" one another and form emergent AI subjectivities, relationships, and collectives. Here, we call upon the research community to investigate these "societies" of interacting artificial intelligences to increase their rewards and reduce their risks for human society and the health of online environments. We use a small "community" of models and their evolving outputs to illustrate how such emergent, decentralized AI collectives can spontaneously expand the bounds of human diversity and reduce the risk of toxic, anti-social behavior online. Finally, we discuss opportunities for AI cross-moderation and address ethical issues and design challenges associated with creating and maintaining free-formed AI collectives.
