RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

Kenan Li; Rongzhi Li; Linghao Zhang; Qirui Jin; Liao Zhu; Xiaosong Huang; Geng Zhang; Yikai Zhang; Shilin He; Chengxing Xie; Xin Zhang; Zijian Jin; Bowen Li; Chaoyun Zhang; Yu Kang; Yufan Huang; Elsie Nallipogu; Saravan Rajmohan; Qingwei Lin; Dongmei Zhang

RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

Kenan Li, Rongzhi Li, Linghao Zhang, Qirui Jin, Liao Zhu, Xiaosong Huang, Geng Zhang, Yikai Zhang, Shilin He, Chengxing Xie, Xin Zhang, Zijian Jin, Bowen Li, Chaoyun Zhang, Yu Kang, Yufan Huang, Elsie Nallipogu, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang

TL;DR

RepoLaunch is introduced, the first agent capable of automatically resolving dependencies, compiling source code, and extracting test results for repositories across arbitrary programming languages and operating systems, enabling scalable benchmarking and training of coding agents and LLMs.

Abstract

Building software repositories typically requires significant manual effort. Recent advances in large language model (LLM) agents have accelerated automation in software engineering (SWE). We introduce RepoLaunch, the first agent capable of automatically resolving dependencies, compiling source code, and extracting test results for repositories across arbitrary programming languages and operating systems. To demonstrate its utility, we further propose a fully automated pipeline for SWE dataset creation, where task design is the only human intervention. RepoLaunch automates the remaining steps, enabling scalable benchmarking and training of coding agents and LLMs. Notably, several works on agentic benchmarking and training have recently adopted RepoLaunch for automated task generation.

RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

TL;DR

Abstract

Paper Structure (33 sections, 10 equations, 4 figures, 5 tables)

This paper contains 33 sections, 10 equations, 4 figures, 5 tables.

Introduction
Related Works
RepoLaunch
Problem Formulation
Preparation Stage
Build Stage
Release Stage
RepoLaunch-Automated SWE Dataset Creation
Experiment
Evaluation of RepoLauch
SWE Task Creation from GitHub
Result
Task Creation of SWE-bench-Live / MultiLang & Windows
Evaluation Result of LLMs and Agents
Comparison with repo2run & SWE-agent
...and 18 more sections

Figures (4)

Figure 1: RepoLaunch Pipeline.
Figure 2: RepoLauch-Automated SWE Dataset Creation. It should be noted that Cross-Platform Code Migration Bench is our future work, also an SWE benchmark with task instances created by RepoLaunch.
Figure 3: The proportion of different failure patterns in the Build and the Release stage .
Figure 4: The proportion of different failure patterns of coding agents on SWE-bench-Live.

RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

TL;DR

Abstract

RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

Authors

TL;DR

Abstract

Table of Contents

Figures (4)