Beyond the Binary: A nuanced path for open-weight advanced AI
Bengüsu Özcan, Alex Petropoulos, Max Reddel
TL;DR
The paper addresses the risks and opportunities of open-weight advanced AI by arguing that openness should be governed through a tiered, safety-anchored framework rather than a binary open/closed distinction. It analyzes the current landscape and governance gaps across major regions, highlighting inconsistent safety practices and the lack of universal standards. The authors propose actionable recommendations—standards, AI Safety Institutes, risk-aware release channels, technical safeguards, and societal preparedness—to enable responsible openness while mitigating misuse. The work targets developers, evaluators, standard-setters, and policymakers, aiming to align technical advancement with robust risk management and public resilience. Its significance lies in offering a concrete, multi-stakeholder path to harness the benefits of open-weight AI without compromising safety and security.
Abstract
Open-weight advanced AI models -- systems whose parameters are freely available for download and adaptation -- are reshaping the global AI landscape. As these models rapidly close the performance gap with closed alternatives, they enable breakthrough research and broaden access to powerful tools. However, once released, they cannot be recalled, and their built-in safeguards can be bypassed through fine-tuning or jailbreaking, posing risks that current governance frameworks are not equipped to address. This report moves beyond the binary framing of ``open'' versus ``closed'' AI. We assess the current landscape of open-weight advanced AI, examining technical capabilities, risk profiles, and regulatory responses across the European Union, United States, China, the United Kingdom, and international forums. We find significant disparities in safety practices across developers and jurisdictions, with no commonly adopted standards for determining when or how advanced models should be released openly. We propose a tiered, safety-anchored approach to model release, where openness is determined by rigorous risk assessment and demonstrated safety rather than ideology or commercial pressure. We outline actionable recommendations for developers, evaluators, standard-setters, and policymakers to enable responsible openness while investing in technical safeguards and societal preparedness.
