Table of Contents
Fetching ...

In Which Areas of Technical AI Safety Could Geopolitical Rivals Cooperate?

Ben Bucknall, Saad Siddiqui, Lara Thurnherr, Conor McGurk, Ben Harack, Anka Reuel, Patricia Paskov, Casey Mahoney, Sören Mindermann, Scott Singer, Vinay Hiremath, Charbel-Raphaël Segerie, Oscar Delaney, Alessandro Abate, Fazl Barez, Michael K. Cohen, Philip Torr, Ferenc Huszár, Anisoara Calinescu, Gabriel Davis Jones, Yoshua Bengio, Robert Trager

TL;DR

This paper examines how geopolitical rivals might safely cooperate on technical AI safety research. It analyzes historical patterns of strategic-technology cooperation, with a US-China AI case study across academia, industry, and intergovernmental channels, to identify risk-management gaps and viable collaboration areas. The authors argue that verification mechanisms and codified protocols are comparatively safer avenues for international cooperation, while infrastructure and evaluation activities pose higher risks of enabling harm or leaking sensitive information. The work provides a structured framework to help researchers and policymakers balance safety benefits with national security concerns, guiding future, more concrete policy and research directions for international AI safety collaboration.

Abstract

International cooperation is common in AI research, including between geopolitical rivals. While many experts advocate for greater international cooperation on AI safety to address shared global risks, some view cooperation on AI with suspicion, arguing that it can pose unacceptable risks to national security. However, the extent to which cooperation on AI safety poses such risks, as well as provides benefits, depends on the specific area of cooperation. In this paper, we consider technical factors that impact the risks of international cooperation on AI safety research, focusing on the degree to which such cooperation can advance dangerous capabilities, result in the sharing of sensitive information, or provide opportunities for harm. We begin by why nations historically cooperate on strategic technologies and analyse current US-China cooperation in AI as a case study. We further argue that existing frameworks for managing associated risks can be supplemented with consideration of key risks specific to cooperation on technical AI safety research. Through our analysis, we find that research into AI verification mechanisms and shared protocols may be suitable areas for such cooperation. Through this analysis we aim to help researchers and governments identify and mitigate the risks of international cooperation on AI safety research, so that the benefits of cooperation can be fully realised.

In Which Areas of Technical AI Safety Could Geopolitical Rivals Cooperate?

TL;DR

This paper examines how geopolitical rivals might safely cooperate on technical AI safety research. It analyzes historical patterns of strategic-technology cooperation, with a US-China AI case study across academia, industry, and intergovernmental channels, to identify risk-management gaps and viable collaboration areas. The authors argue that verification mechanisms and codified protocols are comparatively safer avenues for international cooperation, while infrastructure and evaluation activities pose higher risks of enabling harm or leaking sensitive information. The work provides a structured framework to help researchers and policymakers balance safety benefits with national security concerns, guiding future, more concrete policy and research directions for international AI safety collaboration.

Abstract

International cooperation is common in AI research, including between geopolitical rivals. While many experts advocate for greater international cooperation on AI safety to address shared global risks, some view cooperation on AI with suspicion, arguing that it can pose unacceptable risks to national security. However, the extent to which cooperation on AI safety poses such risks, as well as provides benefits, depends on the specific area of cooperation. In this paper, we consider technical factors that impact the risks of international cooperation on AI safety research, focusing on the degree to which such cooperation can advance dangerous capabilities, result in the sharing of sensitive information, or provide opportunities for harm. We begin by why nations historically cooperate on strategic technologies and analyse current US-China cooperation in AI as a case study. We further argue that existing frameworks for managing associated risks can be supplemented with consideration of key risks specific to cooperation on technical AI safety research. Through our analysis, we find that research into AI verification mechanisms and shared protocols may be suitable areas for such cooperation. Through this analysis we aim to help researchers and governments identify and mitigate the risks of international cooperation on AI safety research, so that the benefits of cooperation can be fully realised.

Paper Structure

This paper contains 27 sections, 1 figure, 1 table.

Figures (1)

  • Figure 1: AI safety co-authorship instances with American researchers (%). Incomplete data from 2023 and 2024 excluded. Data labels at the top of bars show the total number of AI safety papers by US researchers in that year. Data source: emerging_technology_observatory_country_2024. (Note that if a publication lists authors from organisations in more than one country (excluding the US), the publication will "count towards" multiple countries, and thus be represented multiple times in the figure.)