An International Agreement to Prevent the Premature Creation of Artificial Superintelligence
Aaron Scher, David Abecassis, Peter Barnett, Brian Abeyta
TL;DR
The paper tackles the risk of premature artificial superintelligence by proposing an international, verifiable halt that postpones ASI until robust alignment and mitigation strategies exist. Its core approach combines FLOP-based training limits, comprehensive AI chip tracking, and targeted restricted research under a US–China-led coalition and a Coalition Technical Body, complemented by challenge inspections and whistleblower protections. It provides a concrete draft agreement with enforcement and verification mechanisms, plus a staged implementation framework to build political support while gradually expanding oversight. The work highlights the global governance implications of high-stakes AI, acknowledges tradeoffs, and argues that international coordination is essential to prevent a potentially civilization-threatening ASI race.
Abstract
Many experts argue that premature development of artificial superintelligence (ASI) poses catastrophic risks, including the risk of human extinction from misaligned ASI, geopolitical instability, and misuse by malicious actors. This report proposes an international agreement to prevent the premature development of ASI until AI development can proceed without these risks. The agreement halts dangerous AI capabilities advancement while preserving access to current, safe AI applications. The proposed framework centers on a coalition led by the United States and China that would restrict the scale of AI training and dangerous AI research. Due to the lack of trust between parties, verification is a key part of the agreement. Limits on the scale of AI training are operationalized by FLOP thresholds and verified through the tracking of AI chips and verification of chip use. Dangerous AI research--that which advances toward artificial superintelligence or endangers the agreement's verifiability--is stopped via legal prohibitions and multifaceted verification. We believe the proposal would be technically sufficient to forestall the development of ASI if implemented today, but advancements in AI capabilities or development methods could hurt its efficacy. Additionally, there does not yet exist the political will to put such an agreement in place. Despite these challenges, we hope this agreement can provide direction for AI governance research and policy.
