Risks and Opportunities of Open-Source Generative AI
Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schroeder, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Aaron Purewal, Csaba Botos, Fabro Steibel, Fazel Keshtkar, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Imperial, Juan Arturo Nolazco, Lori Landay, Matthew Jackson, Phillip H. S. Torr, Trevor Darrell, Yong Lee, Jakob Foerster
TL;DR
This paper assesses the risks and opportunities of open-source generative AI through a three-stage development framework (near, mid, long-term) and a detailed openness taxonomy. It argues that open sourcing offers net benefits across research, safety, equity, and social impact, while acknowledging certain risks that require system-level safeguards and policy guidance. The authors map regulatory landscapes (EU AI Act, Biden EO, Chinese measures, Middle East initiatives) and analyze current LLM openness, highlighting that weights/data remain largely closed and performance gaps exist. They then present practical recommendations for policy, best practices, and risk mitigation, underscoring that openness—coupled with responsible governance—can enhance transparency, inclusivity, and safety while enabling decentralized innovation and coordination. Overall, the work advocates for permissive open-source practices complemented by voluntary, proactive safety and transparency measures to maximize societal benefits of Gen AI.
Abstract
Applications of Generative AI (Gen AI) are expected to revolutionize a number of different areas, ranging from science & medicine to education. The potential for these seismic changes has triggered a lively debate about the potential risks of the technology, and resulted in calls for tighter regulation, in particular from some of the major tech companies who are leading in AI development. This regulation is likely to put at risk the budding field of open-source generative AI. Using a three-stage framework for Gen AI development (near, mid and long-term), we analyze the risks and opportunities of open-source generative AI models with similar capabilities to the ones currently available (near to mid-term) and with greater capabilities (long-term). We argue that, overall, the benefits of open-source Gen AI outweigh its risks. As such, we encourage the open sourcing of models, training and evaluation data, and provide a set of recommendations and best practices for managing risks associated with open-source generative AI.
